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PROPTCTIOf? Of SftMMA IiIITOLgWIC AMP 
BY A A6-DESAT0RASE 

1 

This is a continuation-in-part of U.S. Serial 

No. 08/789,936 filed January 28, 1997 which is a 

continuation-in-part of U.S. Serial No. 08/307,382, 

5 filed September 14, 1994 which is a continuation of U.S. 

Serial No. 07/959,952 filed October 13, 1992 which is a 

continuation-in-part of U.S. Serial No. 817,919, filed 

January 8, 1992, which is a continuation-in-part 

application of U.S. Serial No. 774,475 filed October 10, 
10 1991, 

FIELD OF THE INVENTION 

, Linoleic acid (18:2) (LA) is transformed into 

gamma linolenic acid (18:3) (GLA) by the enzyme A6- 
desaturase. When this enzyme, or the nucleic acid 
encoding it, is transferred into LA -producing cells, GLA 
is produced. The present invention provides nucleic 
acids comprising the A6- desaturase gene. More 
specifically, the nucleic acids comprise the promoters, 

20 coding regions and termination regions of the A6- 
desaturase genes. The present invention is further 
directed to recombinant constructions comprising a A6- 
desaturase coding region in functional combination with 
heterologous regulatory sequences. The nucleic acids 

25 and recombinant constructions of the instant invention 
are useful in the production of GLA in transgenic 
organisms . 
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BACXGRQDND OF THE INVENTIQW 

1 Unsaturated fatty acids such as linoleic 

(C^gA^'^^) and a-linolenic {C^^A^'^^'^^) acids are essential 
dietary constituents that cannot be synthesized by 
vertebrates since vertebrate cells can introduce double 
5 bonds at the position of fatty acids but cannot 

introduce additional double bonds between the double 
bond and the methyl - terminus of the fatty acid chain* 
Because they are precursors of other products, linoleic 
and a-linolenic acids are essential fatty acids, and are 
10 usually obtained from plant sources. Linoleic acid can 
be converted by mammals into y-linolenic acid (GLA, 
C^gA^'*'^^) which can in turn be converted to arachidonic 
acid (20:4), a critically important fatty acid since it 
is an essential precursor of most prostaglandins • 

15 The dietary provision of linoleic acid, by 

virtue of its resulting conversion to GLA and 
arachidonic acid, satisfies the dietary need for GLA and 
arachidonic acid. However, a relationship has been 
demonstrated between cons\amption of saturated fats and 

20 health risks such as hypercholesterolemia, 

atherosclerosis and other clinical disorders which 
correlate with susceptibility to coronary disease, while 
the consumption of unsaturated fats has been associated 
with decreased blood cholesterol concentration and 

25 reduced risk of atherosclerosis. The therapeutic 
benefits of dietary GLA may result from GLA being a 
precursor to arachidonic acid and thus subsequently 
contributing to prostaglandin synthesis. Accordingly, 
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consumption of the more unsaturated GLA, rather than 
1 linoleic acid, has potential health benefits. However, 
GLA is not present in virtually any commercially grown 
crop plant. 

Linoleic acid is converted into GLA by the 
5 enzyme A6-desaturase. A6-desaturase, an enzyme of more 
than 350 amino acids, has a membrane - bound domain and an 
active site for desaturation of fatty acids. When this 
enzyme is transferred into cells which endogenously 
produce linoleic acid but not GLA, GLA is produced. The 

10 present invention, by providing genes encoding A6- 
desaturase, allows the production of transgenic 
organisms which contain functional A6 -desaturase and 
which produce GLA. In addition to allowing production 
of large amounts of GLA, the present invention provides 

15 new dietary sources of GLA. 



SPMMftRY OF Tag IWVBNTIQW 

The present invention is directed to isolated 
A6 - desaturase genes. Specifically, the isolated genes 
20 comprise the A6- desaturase promoters, coding regions, 
and termination regions. 

The present invention is further directed to 
expression vectors comprising the A6 -desaturase 
promoter, coding region and termination region. 
25 Yet another aspect of this invention is 

directed to expression vectors comprising a A6- 
desaturase coding region in functional combination with 
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heterologous regulatory regions, i.e. elements not 
1 derived from the A6 -desaturase gene. 

Cells and organisms comprising the vectors of 
the present invention, and progeny of such organisms, 
are also provided by the present invention. 
5 . A further aspect of the present invention 

provides isolated bacterial A6- desaturase. Isolated 
plant A6-desaturases are also provided. 

Yet another aspect of this invention provides 
a method for producing plants with increased gamma 
10 linolenic acid content. 

A method for producing chilling tolerant 
plants is also provided by the present invention. 

BRIEF DESCRIPTION OF THE DRAWINGS i 

15 Fig. 1 depicts the hydropathy profiles of the 

deduced amino acid sequences of Svnechoc vs t i s A6- 
desaturase (Panel A) and a12- desaturase (Panel B) . 
Putative membrane spanning regions are indicated by 
solid bars. Hydrophobic index was calculated for a 

20 window size of 19 amino acid residues [Kyte, et al. 
(1982) J. Molec. Biol. 1571 . 

Fig. 2 provides gas liquid chromatography 
profiles of wild type (Panel A) and transgenic (Panel B) 

25 Fig. 3 is a diagram of maps of cosmid cSy75, 

cSyl3 and Csy? with overlapping regions and subclones. 
The origins of subclones of Csy75, Csy75-3.5 and Csy7 
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are indicated by the dashed diagonal lines. Restriction 
1 sites that have been inactivated are in parentheses. 
Fig. 4 provides gas liquid chromatography 
profiles of wild type (Panel A) and transgenic (Panel B) 
tobacco , 

5 Fig. 5A depicts the DNA sequence of a A6- 

desaturase cDNA isolated from borage. 

Fig. 5B depicts the protein sequence of the 
open reading frame in the isolated borage A6-desaturase 
cDNA. Three amino acid motifs characteristic of 
10 desaturases are indicated and are, in order, lipid box, 
metal box 1, and metal box 2. 

Fig. 6 is a dendrogram showing similarity of 
the borage A6-desaturase to other membrane -bovmd 
desaturases. The amino acid sequence of the borage A6- 
15 desaturase was compared to other known desaturases using 
Gene Works (IntelliGenetics) . N-umerical values 
correlate to relative phylogenetic distances between 
subgroups compared. 

Fig. 7 is a restriction map of 221.A6.NOS and 
20 121.A6.NOS. In 221.A6.NOS, the remaining portion of the 
plasmid is pBI221 and in 121.A6.NOS, the remaining 
portion of the plasmid is pBI121. 

Fig. 8 provides gas liquid chromatography 
profiles of mock transfected (Panel A) and 221.A6.NOS 
25 transfected (Panel B) carrot cells. The positions of 
18:2, 18:3 a, and 18:3 y(GIjA) are indicated. 

Fig. 9 provides gas liquid chromatography 
profiles of an untransf ormed tobacco leaf (Panel A) and 
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a tobacco leaf transformed with 121.A6.NOS* The 
1 positions of 18:2, 18:3 a, 18:3y (GLA) , and 18:4 are 
indicated. 

Fig. 10 is the complete DNA sequence and 
deduced amino acid sequence of evening primrose h6- 
5 desaturase. A heme binding motif of cytochrome b5 
proteins is indicated by underlined bold text. 
Underlined plain text indicates three histine rich 
motifs (HRMs) . The motifs in this sequence are 
identical to those found in borage A6- desaturase with 
10 the exception of those that are italicized (S 161 and L 
374) . 

Fig. 11 is a formatted alignment of the 
evening primrose and borage A6- desaturase amino acid 
sequences . 

15 Fig. 12A is a Kyte-Doolittle hydrophobicity 

plot for borage A6 -desaturase. 

Fig. 12B is a Kyte-Doolittle hydrophobicity 
plot for evening primrose A6 -desaturase. 

Fig. 13A is a Hopwood hyflrophobicity plot for 
20 borage A6- desaturase. The y axis is a normalized 

parameter that estimates hydrophobicity; that the x axis 
represents the linear amino acid sequences^ 

Fig. 13B is a Hopwood hydrophobicity plot for 
evening primrose A6- desaturase. X and y axes are as in 
25 Figure 13 A. 

Fig. 14A graphically depicts the location of 
the transmembrane regions for borage A6- desaturase. 
Positive values (y-axis) greater than 500 are considered 
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significant predictors of a membrane spanning region, 
^The X-axis represents the linear amino acid sequences. 
Fig. 14B graphically depicts the location of 
the transmembrane regions for evening primrose A6- 
desaturase. X and y axes are as in Figure 14A. 

5 

PBTAH^BP PESCRIP T IQ W Q W TH B IWV B WT I QW: 

The present invention provides isolated 
nucleic acids encoding A6-desaturase. To identify a 
nucleic acid encoding A6-desaturase, DNA is isolated 

10 from an organism which produces GLA, Said organism can 
be, for example, an animal cell, certain fungi (e.g. 
Mprti^r^ll^) , certain bacteria (e.g. Synechocystis ) or 
certain plants (borage, Oenothera, currants) . The 
isolation of genomic DNA can be accomplished by a 

15 variety of methods well-known to one of ordinary skill 
in the art, as exemplified by Sambrook ^ jai- (1989) in 

MQl^cular Cloning; A L^pratoxy Manual, coid spring 

Harbor, NY. The isolated DNA is fragmented by physical 
methods or enzymatic digestion and cloned into an 

20 appropriate vector, e.g. a bacteriophage or cosmid 

vector, by any of a variety of well-known methods which 
can be found in references such as Sambrook ^ al , 
(1989) . Expression vectors containing the DNA of the 
present invention are specifically contemplated herein. 
25 DNA encoding A-desaturase can be identified by gain of 
function analysis. The vector containing fragmented DNA 
is transferred, for example by infection, 
transconjugation, transf ection, into a host organism 
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that produces linoleic acid but not GLA. As used 
1 herein, "transformation" refers generally to the 

incorporation of foreign DNA into a host cell. Methods 
for introducing recombinant DNA into a host organism are 
known to one of ordinary skill in the art and can be 
5 found, for example, in Sambrook ^ ^1. (1989) . 

Production of GLA by these organisms (i.e., gain of 
function) is assayed, for example by gas chromatography 
or other methods known to the ordinarily skilled 
artisan. Organisms which are induced to produce GLA, 
10 i.e. have gained function by the introduction of the 
vector, are identified as expressing DNA encoding A- 
desaturase, and said DNA is recovered from the 
organisms. The recovered DNA can again be fragmented, 
cloned with expression vectors, and functionally 
15 assessed by the above procedures to define with more 
particularity the DNA encoding A6 - desaturase . 

As an example of the present invention, random 
DNA is isolated from the cyanobacteria Svnechocy s t i s 
Pasteur Culture Collection (PCC) 6803, American Type 
20 Culture Collection (ATCC) 27184, cloned into a cosmid 
vector, and introduced by transconjugation into the GLA- 
deficient Cyanobacterium Anabaena strain PCC 7120, ATCC 
27893. Production of GLA from Aneibaena linoleic acid is 
monitored by gas chromatography and the corresponding 
25 DNA fragment is isolated. 

The isolated DNA is sequenced by methods well- 
known to one of ordinary skill in the art as found, for 
example, in Sambrook al* (1989) . 
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In accordance with the present invention, DNA 
1 molecules comprising AS-desaturase genes have been 

isolated. More particularly, a 3.588 kilobase (kb) DNA 
comprising a A6-desaturase gene has been isolated from 
the cyanobacteria S vnechocys t i s , The nucleotide 
5 sequence of the 3.588 kb DNA was determined and is shown 
in SEQ ID N0:1. Open reading frames defining potential 
coding regions are present from nucleotide 317 to 1507 
and from nucleotide 2002 to 3081. To define the 
nucleotides responsible for encoding A6 -desaturase, the 
10 3.588 kb fragment that confers A6 -desaturase activity is 
cleaved into two subf ragments, each of which contains 
only one open reading frame. Fragment ORFl contains 
nucleotides 1 through 1704, while fragment 0RF2 contains 
nucleotides 1705 through 3588. Each fragment is 
15 subcloned in both forward and reverse orientations into 
a conjugal expression vector {AM542, Wolk ^ aJ.- [1984] 
Proc. Natl, Acad. Sci, nsA £1, 1561) that contains a 
cyanobacterial carboxylase promoter. The resulting 
constructs (i.e. ORFl(F), ORFl (R) , ORF2 (F) and 0RF2 (R) ] 
20 are conjugated to wild- type Anabaena PCC 7120 by 

standard methods (see, for example, Wolk et al. (1984) 
Proc. Natl, Acad, sci, URA QX, 1561) . Conjugated cells 
of Anabaena are identified as Neo^ green colonies on a 
brown background of dying non- conjugated cells after two 
25 weeks of growth on selective media (standard mineral 
media BGllN + containing 30/2g/ml of neomycin according 
to Rippka et al,, (1979) J, Gen MicrobiOl, ill, D . The 
green colonies are selected and grown in selective 
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liquid media (BGllN + with 15/zg/ml neomycin) * Lipids 
1 are extracted by standard methods (e*g. Dahmer et al., 
(1989) jQurn^l q£ Amgrjtcan Oil ChCTdg^l S<?gjLgtY 543) 
from the resulting transconjugants containing the 
forward and reverse oriented ORFl and 0RF2 constructs, 
5 For comparison, lipids are also extracted from wild- type 
cultures of AnabaeUft and SvngChQCYgtlg > The fatty acid 
methyl esters are analyzed by gas liquid chromatography 
(GLC) , for example with a Tracor-560 gas liquid 
chromatograph equipped with a hydrogen flame ionization 
10 detector and a capillary column. The results of GLC 
analysis are shown in Table 1. 



Table 1: Occurrence of C18 fatty acids in wild- type and 

transgenic cyanobacteria 



20 

















Anabaena 
(wild type) 


+ 








+ 




Anabaena + ORFl (F) 






+ 








Anabaena + ORFl (R) 


+ 




+ 




+ 




Anabaena + ORF2 (F) 




+ 


+ 




+ 


+ 


1 Anabaena + 0RF2 (R) 






+ 




+ 




1 Synechocystis 
i (wild type) 






+ 









25 

As assessed by GLC analysis, GLA deficient 
Anabaena gain the function of GLA production when the 
construct containing 0RF2 in forward orientation is 
introduced by transconjugation. Transconjugants 
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containing constructs with 0RF2 in reverse orientation 
1 to the carboxylase promoter, or ORFl in either 

orientation, show no GLA production* This analysis 
demonstrates that the single open reading frame (0RF2) 
within the 1884 bp fragment encodes A6-desaturase. The 
5 1884 bp fragment is shown as SEQ ID NO:3. This is 
substantiated by the overall similarity of the 
hydropathy profiles between A6 - desaturase and a12- 
desaturase [Wada slL al- (1990) Nature 347] as shown in 
Fig. 1 as (A) and (B) , respectively- 

10 Also in accordance with the present invention, 

a cDNA comprising a A6 -desaturase gene from borage 
(SQrqgQ Qfficin^llg) has been isolated • The nucleotide 
sequence of the 1,685 kilobase (kb) cDNA was determined 
and is shown in Fig. 5A (SEQ ID NO: 4) . The ATG start 

15 codon and stop codon are underlined. The amino acid 

sequence corresponding to the open reading frame in the 
borage delta 6 -desaturase is shown in Fig. 5B (SEQ ID 
NO: 5) , 

Additionally, the present invention provides a 
20 A6- desaturase gene from evening primrose ( Oenothera 

biennis ) . The nucleotide sequence of the 1.687 kb cDNA 
was determined and is depicted in Figure 10 (SEQ ID 
NO: 26) . Also shown in Figure 10 is the deduced amino 
acid sequence 6f evening primrose A6 -desaturase. 
25 Isolated nucleic acids encoding A6- desaturase 

can be identified from other GLA -producing organisms by 
the gain of function analysis described above, or by 
nucleic acid hybridization techniques using the isolated 
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nucleic acid which encodes Synechocvstis , borage, or 
1 evening primrose A6-desaturase as a hybridization probe. 
Both methods are known to the skilled artisan and are 
contemplated by the present invention. The 
hybridization probe can comprise the entire DNA sequence 
5 disclosed as SEQ. ID N0:1 or SEQ. ID NO: 4, or a 
restriction fragment or other DNA fragment thereof, 
including an oligonucleotide probe. Methods for cloning 
homologous genes by cross -hybridization are known to the 
ordinarily skilled artisan and can be found, for 

10 example, in Sambrook (1989) and Beltz ^ sX- (1983) 
Methods in Enzymolpgy lOQ, 266. 

In another method of identifying a delta 6- 
desaturase gene from an organism producing GLA, a cDNA 
library is made from poly-A^ RNA isolated from polysomal 

15 RNA. In order to eliminate hyper -abundant expressed 
genes from the cDNA population, cDNAs or fragments 
thereof corresponding to hyper - abundant cDNAs genes are 
used as hybridization probes to the cDNA library. Non 
hybridizing plaques are excised and the resulting 
20 bacterial colonies are used to inwculate liquid cultures 
and sequenced. For example, as a means of eliminating 
other seed storage protein cDNAs from a cDNA library 
made from borage polysomal RNA, cDNAs corresponding to 
abundantly expressed seed storage proteins are first 
25 hybridized to the cDNA library. The "svibtracted" DNA 

library is then used to generate expressed sequence tags 
(ETSs) and such tags are used to scan a data base such 
as GenBank to identify potential desaturates. 
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Using another method, an evening primrose cDNA 
^ may be isolated by first synthesizing sequences from the 
borage A6 -desaturase gene and then using these sequences 
as primers in a PGR reaction with the evening primrose 
cDNA library serving as template. PGR fragments of 
5 expected size may then be used to screen an evening 
primrose cDNA library. Hybridizing clones may then be 
sequenced and compared to the borage cDNA sequence to 
determine if the hybridizing clone represents an evening 
primrose A6-desatuase gene. 

10 Transgenic organisms which gain the function 

of GLA production by introduction of DNA encoding A6- 
desaturase also gain the function of octadecatetraeonic 
acid (18:4^^'^'"'^^) production. Octadecatetraeonic acid 
is present normally in fish oils and in some plant 

15 species of the BQiTftginacefte family (Craig ^ ai. [1964] 
J. Amer. Oil Chem. Soc, H, 209-211; Gross ^ [1976] 
Can. J. Plant Sci. 659-664). In the transgenic 

organisms of the present invention, octadecatetraenoic 
acid results from further desaturation of a-linolenic 

20 acid by A6 -desaturase or desaturation of GLA by a15- 
desaturase . 

The 359 amino acids encoded by 0RF2, i.e. the 
open reading frame encoding S ynechocvstis A6 -desaturase, 
are shown as SEQ. ID NO: 2. The open reading frame 
25 encoding the borage A6 - desaturase is shown in SEQ ID NO: 
5. The present invention further contemplates other 
nucleotide sequences which encode the amino acids of SEQ 
ID NO: 2 and SEQ ID NO: 5. It is within the ken of the 
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ordinarily skilled artisan to identify such sequences 
^which result, for example, from the degeneracy of the 
genetic code, Purthezitiore, one of ordinary skill in the 
art can determine, by the gain of function analysis 
described hereinabove, smaller subfragments of the 
5 fragments containing the open reading frames which 
encode A6-desaturases. 

The present invention contemplates any such 
polypeptide fragment of A6-desaturase and the nucleic 
acids therefor which retain activity for converting LA 
10 to GLA. 

In another aspect of the present invention, a 
vector containing a nucleic acid of the present 
invention or a smaller fragment containing the promoter, 
coding sequence and termination region of a A6- 

15 desaturase gene is transferred into an organism, for 
example, cyanobacteria, in which the A6 - desaturase 
promoter and termination regions are functional . 
Accordingly, organisms producing recombinant A6- 
desaturase are provided by this invention. Yet another 

20 aspect of this invention provides isolated A6- 

desaturase, which can be purified from the recombinant 
organisms by standard methods of protein purification, 
(For example, see Ausubel oii al. [1987] Current 
Protocols in Molecular Bioloov. Green Publishing 
25 Associates, New York) . 

Vectors containing DNA encoding A6- desaturase 
are also provided by the present invention. It will be 
apparent to one of ordinary skill in the art that 
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appropriate vectors can be constructed to direct the 
expression of the A6-desaturase coding sequence in a 
variety of organisms. Replicable e3q>ression vectors are 
particularly preferred. Replicable expression vectors 
as described herein are DNA or RNA molecules engineered 
5 for controlled expression of a desired gene, i.e- the 
A6-desaturase gene. Preferably the vectors are 
plasmids, bacteriophages, cosmids or viruses. Shuttle 
vectors, e.g. as described by Wolk slL (1984) Proc. 

Na t l t A cad, SQi. U SA. 1561-1565 and Bustos ^ (1991) 
10 J- Bacteriol, 174 . 7525-7533, are also contemplated in 
accordance with the present invention. Sambrook al . 
(1989), Goeddel, ed. (1990) M^thQdS in EnZYmQlQgY 1^5. 
Academic Press, and Perbal (1988) A Pragtical Guide tQ 
Molegular Cloning/ John Wiley and Sons, Inc., provide 
15 detailed reviews of vectors into which a nucleic acid 
encoding the present A6 - desaturase can be inserted and 
expressed. Such vectors also contain nucleic acid 
sequences which can effect expression of nucleic acids 
encoding A6- desaturase. Sequence elements capable of 
20 effecting expression of a gene product include 

promoters, enhancer elements, upstream activating 
sequences, transcription termination signals and 
polyadenylation sites. The upstream 5' untranslated 
region of the evening primrose A6- desaturase gene as 
25 depicted in Figure 10 may also be used. Both 
constitutive and tissue specific promoters are 
contemplated. For transformation of plant cells, the 
cauliflower mosaic virus (CaMV) 35S promoter, other 
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constitutive promoters and promoters which are regulated 
^ during plant seed maturation are of particular interest • 
All such promoter and transcriptional regulatory 
elements, singly or in combination, are contemplated for 
use in the present replicable expression vectors and are 
5 knovm to one of ordinary skill in the art* The CaMV 355 
promoter is described, for example, by Restrepo ^ al , 
{1990) Plant Cell 2, 987. Genetically engineered and 
mutated regulatory sequences are also contemplated. 

The ordinarily skilled artisan can determine 
vectors and regulatory elements suitable for expression 
in a particular host cell. For example, a vector 
comprising the promoter from the gene encoding the 
carboxylase of Anabaena operably linked to the coding 
region of A6-desaturase and further operably linked to a 

15 termination signal from S vnechocy s t i ^ is appropriate for 
expression of A6 - desaturase in cyanobacteria. "OperaJDly 
linked" in this context means that the promoter and 
terminator sequences effectively function to regulate 
transcription. As a further example, a vector 

20 appropriate for expression of A6 -desaturase in 

transgenic plants can comprise a seed- specif ic promoter 
sequence derived from helianthinin, napin, or glycinin 
operably linked to the A6 - desaturase coding region and 
further operably linked to a seed termination signal or 

25 the nopaline synthase termination signal. As a still 
further example, a vector for use in expression of A6- 
desaturase in plants can comprise a constitutive 
promoter or a tissue specific promoter operably linked 
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to the A6 - desaturase coding region and further operably 
1 linked to a constitutive or tissue specific terminator 
or the nopaline synthase termination signal. 

In particular, the helianthinin regulatory 
elements disclosed in applicant's copending U.S. 
5 Application Serial No. 682,354, filed April 8, 1991 and 
incorporated herein by reference, are contemplated as 
promoter elements to direct the expression of the A6- 
desaturases of the present invention. The albiimin 
regulatory elements disclosed in applicant's copending 

10 U.S. application Serial No. 08/831,570 and the oleosin 
regulatory elements disclosed in applicant's copending 
U.S. application Serial No. 08/831, 575 (both applications 
filed April 9, 1997), and incorporated herein by 
reference, are also contemplated as elements to direct 

15 the expression of the A6-desaturases of the present 
invention. 

Modifications of the nucleotide sequences or 
regulatory elements disclosed herein which maintain the 
functions contemplated herein are within the scope of 
20 this invention. Such modifications include insertions, 
substitutions and deletions, and specifically 
substitutions which reflect the degeneracy of the 
genetic code. 

Standard techniques for the construction of 
25 such hybrid vectors are well-known to those of ordinary 
skill in the art and can be found in references such as 
Sambrook fit Ai- (1989) , or any of the myriad of 
laboratory manuals on recombinant DNA technology that 
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are widely available, A variety of strategies are 
^ available for ligating fragments of DNA, the choice of 
which depends on the nature of the termini of the DNA 
fragments. It is further contemplated in accordance 
with the present invention to include in the hybrid 
5 vectors other nucleotide sequence elements which 
facilitate cloning, expression or processing, for 
example sequences encoding signal peptides, a sequence 
encoding iCDEL or related sequence, which is required for 
retention of proteins in the endoplasmic reticulum or 
10 sequences encoding transit peptides which direct A6- 

desaturase to the chloroplast. Such sequences are known 
to one of ordinary skill in the art. An optimized 
transit peptide is described, for example, by Van den 
Broeck ai. (1985) Nature Hi, 358. Prokaryotic and 

15 eukaryotic signal sequences are disclosed, for example, 
by Michaelis ^ (1982) Ann, Rev, Microbiol, 1£, 425. 

A further aspect of the instant invention 
provides organisms other than cyanobacteria or plants 
which contain the DNA encoding the A6 - desaturase of the 

20 present invention. The transgenic organisms 

contemplated in accordance with the present invention 
include bacteria, cyanobacteria, fungi, and plants and 
animals. The isolated DNA of the present invention can 
be introduced into the host by methods known in the art, 

25 for example infection, transf ection, transformation or 
transconjugation. Techniques for transferring the DNA 
of the present invention into such organisms are widely 
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known and provided in references such as Sambrook fit 
1 (1989) . 

A variety of plant transformation methods are 
known. The A6 -desaturase gene can be introduced into 
plants by a leaf disk transformation- regeneration 
5 procedure as described by Horsch (1985) Science 

227 . 1229. Other methods of trans format ion , such as 
protoplast culture (Horsch ^ (1984) Science 223 . 

496; DeBlock ^ (1984) EMBO J. 2143; Barton ^ 

(1983) £^11 22/ 1033) can also be used and are 
10 within the scope of this invention. In a preferred 
embodiment plants are transformed with Agrobacterium - 
derived vectors such as those described in Klett et al. 
(1987) Annu. Rev. Plant Physiol. 2a:467, However, other 
methods are available to insert the A6 -desaturase genes 
15 of the present invention into plant cells. Such 

alternative methods include biolistic approaches (Klein 
^ (1987) liatiitfi 122, 70), electroporation, 

chemically- induced DNA uptake, and use of viruses or 
pollen as vectors. 
20 vnien necessary for the transformation method, 

the A6- desaturase genes of the present invention can be 
inserted into a plant transformation vector, e.g, the 
binary vector described by Bevan (1984) Nucleic Acids 
12/ 8111. Plant transformation vectors can be 
25 derived by modifying the natural gene transfer system of 
Aarobacteriim tumefaciens. The natural system comprises 
large Ti ( tumor- inducing) -plasmids containing a large 
segment, known as T-DNA, which is transferred to 
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transformed plants* Another segment of the Ti plasmid, 
^ the vir region, is responsible for T-DNA transfer. The 
T-DNA region is bordered by terminal repeats* In the 
modified binary vectors the tumor -inducing genes have 
been deleted and the functions of the vir region are 
5 utilized to transfer foreign DNA bordered by the T-DNA 
border sequences* The T- region also contains a 
selectable marker for antibiotic resistance, and a 
multiple cloning site for inserting sequences for 
transfer. Such engineered strains are known as 

10 "disarmed" A. tumefaciens strains, and allow the 

efficient transformation of sequences bordered by the T- 
region into the nuclear genomes of plants. 

Surf ace- sterilized leaf disks are inoculated 
with the "disarmed" foreign DNA- containing ^ 

-1-5 tumefaciens , cultured for two days, and then transferred 
to antibiotic -containing medium. Transformed shoots are 
selected after rooting in medium containing the 
appropriate antibiotic, transferred to soil and 
regenerated. 

20 Another aspect of the present invention 

provides transgenic plants or progeny of these plants 
containing the isolated DNA of the invention. Both 
monocotyledenous and dicotyledenous plants are 
contCTiplated. Plant cells are transformed with the 

25 isolated DNA encoding A6-desaturase by any of the plant 
transformation methods described eJDOve* The transformed 
plant cell, usually in a callus culture or leaf disk, is 
regenerated into a complete transgenic plant by methods 
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well- known to one of ordinary skill in the art (e,g, 
^ Horsch ^ slI. (1985) Science 222# 1129) - In a preferred 
embodiment, the transgenic plant is sunflower, oil seed 
rape, maize, tobacco, peanut or soybean. Since progeny 
of transformed plants inherit the DNA encoding A6- 
5 desaturase, seeds or cuttings from transformed plants 
are used to maintain the transgenic plant line. 

The present invention further provides a 
method for providing transgenic plants with an increased 
content of GLA. This method includes introducing DNA 
10 encoding A6 -desaturase into plant cells which lack or 
have low levels of GLA but contain LA, and regenerating 
plants with increased GLA content from the transgenic 
cells. In particular, commercially grown crop plants 
are contemplated as the transgenic organism, including, 
15 but not limited to, sunflower, soybean, oil seed rape, 
maize, peanut and tobacco* 

The present invention further provides a 
method for providing transgenic organisms which contain 
GLA. This method comprises introducing DNA encoding A6- 
20 desaturase into an organism which lacks or has low 

levels of GLA, but contains LA, In another embodiment, 
the method comprises introducing one or more expression 
vectors which comprise DNA encoding A12- desaturase and 
A6- desaturase into organisms which are deficient in both 
25 GLA and LA, Accordingly, organisms deficient in both LA 
and GLA are induced to produce LA by the expression of 
a12- desaturase, and GLA is then generated due to the 
expression of AS -desaturase, Es^ression vectors 
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comprising DNA encoding Al2-desaturase, or A12- 
^ desaturase and AS-desaturase, can be constructed by 
methods of recombinant technology known to one of 
ordinary skill in the art (Sambrook ^ 32.^, 1989) and 
the published sequence of A12 -desaturase (Wada £t sJL 
5 [1990] Nftturg {hQn<iQn) Ml, 200-203. m addition, it 
has been discovered in accordance with the present 
invention that nucleotides 2002-3081 of SEQ. ID N0:1 
encode cyanobacterial A12 -desaturase. Accordingly, this 
sequence can be used to construct the subject expression 
10 vectors. In particular, commercially grown crop plants 
are contemplated as the transgenic organism, including, 
but not limited to, sunflower, soybean, oil seed rape, 
maize, peanut and tobacco. 

The present invention is further directed to a 
15 method of inducing chilling tolerance in plants. 

Chilling sensitivity may be due to phase transition of 
lipids in cell membranes. Phase transition temperature 
depends upon the degree of unsaturation of fatty acids 
in membrane lipids, and thus increasing the degree of 
20 unsaturation, for example by introducing A6- desaturase 
to convert LA to GLA, can induce or improve chilling 
resistance. Accordingly, the present method comprises 
introducing DNA encoding A6- desaturase into a plant 
cell, and regenerating a plant with improved chilling 
25 resistance from said transformed plant cell. In a 

preferred embodiment, the plant is a sunflower, soybesin, 
oil seed rape, maize, peanut or tobacco plant. 
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The following examples further illustrate the 
1 present invention. 
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BZAMPLE 1 

1 Strains and Culture Conditions 

SvnerihQCvstis (PCC 6803, ATCC 27184), Anabaena 
(PCC 7120, ATCC 27893) and RvneGhococcus (PCC 7942, ATCC 
5 33912) were grown photoautotrophically at 30°C in BG11N+ 
medium (Rippka £t ai- [1979] J. Gen. Mlnrobiol. m, 1- 
61) under illumination of incandescent lamps 
(60uE,ra"^.S'^) . Cosmids and plasmids were selected and 
propagated in Escherichia coli strain DH5a on LB medium 
10 supplemented with antibiotics at standard concentrations 
as described by Mania tis st ai- (1982) Molecular 
Cloning ; Lat^oratory Manual. Cold Spring Harbor 
Laboratory, Cold Spring, New York. 

15 
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BXAMPLE 2 

Construct ion of Synechocystis Cosoiid Genomic Library 

Total genomic DNA from S vnechocvs t i s (PCC 
6803) was partially digested with fiaii3A and fractionated 
5 on a sucrose gradient (Ausiibel ^ al- [19871 Current 
ProtOgPlig in MQlgCUlar ^iQlQ<n, Greene Publishing 
Associates and Wiley Intersciencer New York) . Fractions 
containing 30 to 40 kb DNA fragments were selected and 
ligated into the dephosphorylated BafllHI site of the 

10 cosmid vector, pDUCA7 {Buikema ^ al- [1991] J, 
Bacteriol > 173 . 1879-1885). The ligated DNA was 
packaged in vitro as described by Ausubel ^ (1987) , 

and packaged phage were propagated in E. coli DH5a 
containing the Ava l and E5sq4711 methylase helper 

15 plasmid, pRL528 as described by Buikema ^ (1991) . 

A total of 1152 colonies were isolated randomly and 
maintained individually in twelve 96 -well microtiter 
plates • 

20 
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EXAMPLS 3 

^ Gain -o£- Function E:q;>ression of 6L& in Anflfrg^H^ 

Anabaena (PCC 7120), a filamentous 
cyanobacterium, is deficient in GLA but contains 
5 significant amounts of linoleic acid, the precursor for 
GLA (Figure 2; Table 2). The Svnechocvstis cosmid 
library described in Example 2 was conjugated into 
Anabaena (PCC 7120) to identify transconjugcuits that 
produce GLA, Anabaena cells were grown to mid- log phase 
10 in BG11N+ liquid medium and resuspended in the same 
medium to a final concentration of approximately 2x10® 
cells per ml. A mid- log phase culture of E. coli RP4 
(Burkardt ^ [1979] J. Gen, Microbiol. Hi, 341-348) 

grown in LB containing ampicillin was washed and 
15 resuspended in fresh LB medium, Anabaena and RP4 were 
then mixed and spread evenly on BG11N+ plates containing 
5% LB. The cosmid genomic library was replica plated 
onto LB plates containing 50 ug/ml kanamycin and 17.5 
ug/ml chloramphenicol and was siibsequently patched onto 
20 BG11N+ plates containing Anabaena and RP4, After 24 
hours of inciibation at BO^'C, 30 ug/ml of neomycin was 
underlaid; and incubation at 30°C was continued until 
transconjugants appeared. 

Individual transconjugants were isolated after 
25 conjugation and grown in 2 ml BG11N+ liquid medium with 
15 ug/ml neomycin. Fatty acid methyl esters were 
prepared from wild type cultures and cultures containing 
pools of ten transconjugants as follows. Wild type and 
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transgenic cyanobacterial cultures were harvested by 
1 cent rifugat ion and washed twice with distilled water. 
Fatty acid methyl esters were extracted from these 
cultures as described by Dahmer ^ (1989) tJ. Amer. 

Oil, Chem, Soc. 66 , 543-548 and were analyzed by Gas 
5 Liquid Chromatography (GLC) using a Tracer- 560 equipped 
with a hydrogen flame ionization detector and capillary 
column {30 m x 0*25 mm bonded FSOT Superox II, Alltech 
Associates Inc., IL) . Retention times and co- 
chromatography of standards (obtained from Sigma 
10 Chemical Co,) were used for identification of fatty 
acids. The average fatty acid composition was 
determined as the ratio of peak area of each C18 fatty 
acid normalized to an internal standard. 

Representative GLC profiles are shown in Fig. 

15 2. C18 fatty acid methyl esters are shown. Peaks were 
identified by comparing the elution times with known 
standards of fatty acid methyl esters and were confirmed 
by gas chromatography -mass spectrometry. Panel A 
depicts GLC analysis of fatty acids of wild type 

20 Anabaena . The arrow indicates the migration time of 
GLA. Panel B is a GLC profile of fatty acids of 
transconjugants of Anaba^r^^ with pAM542+1.8F. Two GLA 
producing pools (of 25 pools representing 250 
transconjugeints) were identified that produced GLA. 

25 Individual transconjugants of each GLA positive pool 
were analyzed for GLA production; two independent 
transconjugants, AS13 and AS75, one from each pool, were 
identified which expressed significant levels of GLA and 
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which contained cosmids, cSyl3 and cSy75, respectively 
1 (Figure 3) . The cosmids overlap in a region 

approximately 7.5 kb in length. A 3.5 kb Nhe l fragment 
of cSy75 was recloned in the vector pDUCA7 and 
transferred to Anabaena resulting in gain -of -function 
5 expression of GLA (Table 2) . 

Two Mlfil/Hind III subfragments (1.8 and 1.7 
kb) of the 3.5 kb Nhe I fragment of cSy75-3.5 were 
subcloned into "pBLUESCRIPT" (Stratagene) (Figure 3) for 
sequencing. Standard molecular biology techniques were 
10 performed as described by Mania t is ^ (1982) and 

Ausubel ^ (1987) . Dideoxy sequencing (Sanger ^ al ■ 

[1977] PrQC. Natlt Acadt Sgir USA 21 > 5463-5467) of 
pBSl.8 was performed with "SEQUENASE" (United States 
Biochemical) on both strands by using specific 

15 oligonucleotide primers synthesized by the Advanced DNA 
Technologies Laboratory (Biology Department, Texas A & M 
University) . DNA sequence analysis was done with the 
GCG (Madison, WI) software as described by Devereux ^ 
al. (1984) NUCl^jg Acidg fieg, 12. 387-395, 

20 Both Hbtfel/flindlll STibf ragmen ts were 

transferred into a conjugal expression vector, AM542, in 
both forward and reverse orientations with respect to a 
cyanobacterial carboxylase promoter and were introduced 
into An^];>^f?nfi! by conjugation. Transconjugants 

25 containing the 1.8 kb fragment in the foirward 

orientation (AM542-1.8F) produced significant quantities 
of GLA and octadecatetraenoic acid (Figure 2; Table 2) . 
Transconjugants containing other constructs, either 
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reverse oriented 1.8 kb fragment or forward and reverse 
1 oriented 1.7 kb fragment, did not produce detectable 
levels of GLA (Table 2) . 

Figure 2 compares the C18 fatty acid profile 
of an extract from wild type Anabaena (Figure 2A) with 
5 that of transgenic ftnf^t>i=^erni^ containing the 1.8 kb 

fragment of cSy75-3.5 in the forward orientation (Figure 
2B) . GLC analysis of fatty acid methyl esters from 
AM542-1.8F revealed a peak with a retention time 
identical to that of authentic GLA standard. Analysis 

10 of this peak by gas chromatography -mass spectrometry 
(GC-MS) confirmed that it had the same mass 
fragmentation pattern as a GLA reference sample. 
Transgenic Anafe^^^n^ with altered levels of 
polyunsaturated fatty acids were similar to wild type in 

15 growth rate and morphology. 
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EXAMPLE 4 

1 Transformation o£ Synechococcug 

with a6 and a12 Desatura^se Genes 

A third cosmid, cSy7, which contains a a12- 
5 desaturase gene, was isolated by screening the 

Synechocvs 1 1 s genomic library with a oligonucleotide 
synthesized from the published Synechocvs t i s a12- 
desaturase gene sequence (Wada £t [1990] Nature 

(London) 347 , 200-203). A 1.7 kb AysI fragment from 

20 this cosmid containing the a12 -desaturase gene was 

identified and used as a probe to demonstrate that cSyl3 
not only contains a A6 -desaturase gene but also a a12- 
desaturase gene (Figure 3) . Genomic Southern blot 
analysis further showed that both the A6-and a12- 

2^ desaturase genes are unique in the Synechocvs t i s genome 
so that both functional genes involved in C18 fatty acid 
desaturation are linked closely in the S vnechocy s t i ^ 
genome . 

The unicellular cyanobacterium Svnechococcus 
2Q (PCC 7942) is deficient in both linoleic acid and 
GLA(3). The a12 and A6- desaturase genes were cloned 
individually and together into pAM854 (Bustos ^ al , 
[1991] J. Bacterid , MA. 7525-7533), a shuttle vector 
that contains sequences necessary for the integration of 
2^ foreign DNA into the genome of Synechococcu^ (Golden jat 
ai. [1987] Methods in Enzvmol . 153. 215-231). 
Svnechococcus was transformed with these gene constructs 
and colonies were selected. Fatty acid methyl esters 
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were extracted from transgenic Svnechococcus and 
^ analyzed by GLC, 

Table 2 shows that the principal fatty acids 
of wild type Svnechococcus are stearic acid (18:0) and 
oleic acid (18:1) . Svnechococcus transformed with 
5 pAM854-Al2 expressed linoleic acid (18:2) in addition to 
the principal fatty acids* Transformants with pAM854-A6 
and a12 produced both linoleate and GliA (Table 1) . 
These results indicated that Synechococcus containing 
both a12- and A6 -desaturase genes had gained the 

10 capability of introducing a second double bond at the 
a12 position and a third double bond at the A6 position 
of C18 fatty acids. However, no changes in fatty acid 
composition was observed in the transformant containing 
pAM854-A6, indicating that in the absence of substrate 

15 synthesized by the A12 desaturase, the A6-desaturase is 
inactive. This experiment further confirms that the 1,8 
kb Uhfil/Hifldlll fragment (Figure 3) contains both coding 
and promoter regions of the Svnechocvstis A6-desaturase 
gene. Transgenic S ynechococcus with altered levels of 
20 polyunsaturated fatty acids were bimilar to wild type in 
growth rate and morphology. 
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EXAMPLB 5 

^ Nucleotide Sequence o£ A6-Desaturase 

The nucleotide sequence of the 1-8 kb fragment 
of cSy75-3»5 including the functional A6-desaturase gene 
5 was determined. An open reading frame encoding a 
polypeptide of 359 amino acids was identified (Figure 
4) . A Kyte-Doolittle hydropathy analysis (Kyte ^ 
[1982] J. Mol, Biol, 157 , 105-132) identified two 
regions of hydrophobic amino acids that could represent 

10 transmembrane domains (Figure lA) ; furthermore, the 
hydropathic profile of the A6-desaturase is similar to 
that of the Al2-desaturase gene (Figure IB; Wada ^ al , ) 
and A9-desaturases (Thiede ^ al- [1986] J, Biol, Chem. 
2£1, 13230-13235), However, the sequence similarity 

15 between the Synechocystis A6- and A12 -desaturases is 

less than 40% at the nucleotide level and approximately 
18% at the amino acid level. 

20 
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BXAMPLB 6 

1 Transfer of Cyanobacterial A^-Desaturase into Tobacco 

The cyanobacterial A^-desaturase gene was 
mobilized into a plant expression vector and transferred 
5 to tobacco using Aarobacterinin mediated gene transfer 
techniques. To ensure that the transferred desaturase 
is appropriately expressed in leaves and developing 
seeds and that the desaturase gene product is targeted 
to the endoplasmic reticultim or the chloroplast, various 

10 expression cassettes with s ynechocys t i s a -desaturase 
open reading frame (ORF) were constructed. Components 
of these cassettes include: (i) a 35S promoter or seed 
specific promoter derived from the sunflower 
helianthinin gene to drive A6 -desaturase gene expression 

15 in all plant tissues or only in developing seeds 

respectively, (ii) a putative signal peptide either from 
carrot extension gene or sunflower helianthinin gene to 
target newly synthesized A6- desaturase into the ER, 
(iii) an ER lumen retention signal sequence (KDEL) at 

20 the COOH- terminal of the A6- desaturase ORF, and (iv) an 
optimized transit peptide to target A6 desaturase into 
the chloroplast. The 35S promoter is a derivative of 
pRTL2 described by Restrepo ^ (1990) , The 

optimized transit peptide sequence is described by Van 

25 de Broeck £t al. (1985) . The carrot extensin signal 
peptide is described by Chen ^ (1985) EMBO J. jg., 
2145. 
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Transgenic tobacco plants were produced 
^ containing a chimeric cyanobacterial desaturase gene, 
comprised of the Svnechocystis A6- desaturase gene fused 
to an endoplasmic reticulum retention sequence (KDEL) 
and extensin signal peptide driven by the CaMV 35S 
5 promoter • PGR amplifications of transgenic tobacco 
genomic DNA indicate that the A6- desaturase gene was 
incorporated into the tobacco genome. Fatty acid methyl 
esters of leaves of these transgenic tobacco plants were 
extracted and analyzed by Gas Liquid Chromatography 
10 (GLC) . These transgenic tobacco accumulated significant 
amounts of GLA {Figure 4) . Figure 4 shows fatty acid 
methyl esters as determined by GLC. Peaks were 
identified by comparing the elution times with known 
standards of fatty acid methyl ester. Accordingly, 
15 cyanobacterial genes involved in fatty acid metabolism 
can be used to generate transgenic plants with altered 
fatty acid compositions. 
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EXAMPLE 7 
Construction of Borage cDMA librairy 

Membrane bound polysomes were isolated from 
borage seeds 12 days post pollination (12 DPP) using the 
protocol established for peas by Larkins and Davies 
(1975 Plant Phys . 55;749"756). RNA was extracted from 
the polysomes as described by Mechler (1987 Methods in 
Enzymology 152 ;241-248. Academic Press) . 

Poly-A+ RNA was isolated from the membrane 
bound polysomal RNA by use of Oligotex-dT beads 
(Qiagen) , Corresponding cDNA was made using 
Stratagene's ZAP cDNA synthesis kit. The cDNA library 
was constructed in the lambda ZAP II vector (Stratagene) 
using the lambda ZAP II vector kit. The primary library 
was packaged in Gigapack II Gold packaging extract 
(Stratagene) . The library was used to generate 
expressed sequence tags (ESTs) , and sequences 
corresponding to the tags were used to scan the GenBank 
database. 
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EXAMPLE 8 
Hybridization Protocol 

Hybridization probes for screening the borage 
cDNA library were generated by using random primed DNA 
synthesis as described by Ausubel sJt ^ (1994 Current 
PrQtocolg in MQlegul^r Biolpgy/ wiley interscience, 
N.Y.) and corresponded to previously identified 
abundantly expressed seed storage protein cDNAs. 
Unincorporated nucleotides were removed by use of a G-50 
spin column {Boehringer Manheim) , Probe was denatured 
for hybridization by boiling in a water bath for 5 
minutes, then quickly cooled on ice. Filters for 
hybridization were prehybridized at 60°C for 2-4 hours 
in prehybridization solution {6XSSC [Maniatis et al 1984 
Molecular Cloning A Laboratory Manual, Cold Spring 
Harbor Laboratory], IX Denharts Solution, 0.05% sodixim 
pyrophosphate, 100 iig/ml denatured salmon sperm DNA) . 
Denatured probe was added to the hybridization solution 
(6X SSC, IX Denharts solution, 0.05% sodium 
pyrophosphate, 100 ug/ml denatured salmon sperm DNA) and 
incubated at eo^'C with agitation overnight* Filters 
were washed in 4x, 2x, and Ix SET washes for 15 minutes 
each at GO^'C. A 2 OX SET stock solution is 3M NaCl, 0.4 
M Tris base, 20 mM Na2EDTA-2H20. The 4X SET wash was 4X 
SET, 12.5 mM PO^, pH 6.8 and 0.2% SDS. The 2X SET wash 
was 2X SET, 12.5 mM PO^, pH 6.8 and 0.2% SDS. The IX SET 
wash was IX SET, 12.5 mM PO^, pH 6.8 and 0.2% SDS. 
Filters were allowed to air dry and were then ej^osed to 
X-ray film for 24 hours with intensifying screens at - 
SO^C. 
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EZAMPLE 9 

Random sequencing of cDNAs from a borage seed 
(12 DPP) membrane -bound polysomal library 

1 

The borage cDNA library was plated at low 
density (500 pfu on 150 ram petri dishes) . Highly 
prevalent seed storage protein cDNAs were "subtracted" 
by screening with the previously identified 
corresponding cDNAs. Non- hybridizing plaques were 
excised using Stratagene's excision protocol and 
reagents. Resulting bacterial colonies were used to 
inoculate liquid cultures and were either sequenced 
"^^ manually or by an ABI automated sequencer. Each cDNA 
was sequenced once and a sequence tag generated from 
200-300 base pairs. All sequencing was performed by 
cycle sequencing (Epicentre) • Over 300 ESTs were 
generated. Each sequence tag was compared to GenBank 
database by BLASTX computer program and a number of 
lipid metabolism genes, including the A6 -desaturase were 
identified. 

Database searches with a cDNA clone designated 
mbp-65 using BLASTX with the GenBank database resulted 

20 in a significant match to the Svnechocystis A6- 

desaturase. It was determined however, that this clone 
was not a full length cDNA. A full length cDNA was 
isolated using mbp-65 to screen the borage membrane- 
bound polysomal library. The sequence of the isolated 

25 CDNA was determined (Fig. 5A, SEQ ID NO: 4) and the 

protein sequence of the open reading frame (Fig. 5B, SEQ 
ID NO: 5) was compared to other known desaturases using 
Geneworks (IntelligGenetics) protein alignment program 

30 
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(Fig. 2) ♦ This alignment indicated that the cDNA was 
the borage A6-desaturase gene. 

Although similar to other known plant 
desaturases, the borage delta 6-desaturase is distinct 
as indicated in the dendrogram shown in Fig. 6* 
Furthermore, comparison of the amino acid sequences 
characteristic of desaturases, particularly those 
proposed to be involved in metal binding (metal box 1 
and metal box 2) , illustrates the differences between 
the borage delta 6-desaturase and other plant 
desaturases (Table 3) . 

The borage delta 6-desaturase is distinguished 
from the cyanobacterial form not only in over all 
sequence (Fig. 6) but also in the lipid box, metal box 1 
and metal box 2 amino acid motifs (Table 3) • As Table 3 
indicates, all three motifs are novel in sequence. Only 
the borage delta 6-desaturase metal box 2 showed some 
relationship to the Synechocystr^a delta -6 desaturase 
metal box 2. 

In addition, the borage delta 6-desaturase is 
also distinct from another borage desaturase gene, the 
delta- 12 desaturase. PI -81 is a full length cDNA that 
was identified by EST analysis and shows high similarity 
to the Arabidonsis delta-12 desaturase (Fad 2) . A 
comparison of the lipid box, metal box 1 and metal box 2 
amino acid motifs (Table 3) in borage delta 6 and delta- 
12 desaturases indicates that little homology exists in 
these regions. The placement of the two sequences in 
the dendrogram in Fig. 6 indicates how distantly related 
these two genes are. 
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EXAMPLE 10 

Construction of 222.1A^S for transient 
and expression 

The vector pBI221 (Jefferson et al, 1987 
EMBO J, £: 3901 -3907) was prepared for ligation by 
digestion with BamHI and EcoICR I (Promega) which 
excises the GUS coding region leaving the 35S promoter 
and NOS terminator intact. The borage A6-desaturase 
cDNA was excised from the Bluescript plasmid 
(Stratagene) by digestion with BamHI and Xhol. The 
Xhol end was made blunt by use of the Klenow fragment. 
This fragment was then cloned into the BamHI/EcoICR I 
sites of PBI221, yielding 221.1A^NOS (Fig. 7). In 
221, lA^. NOS, the remaining portion (backbone) of the 
restriction map depicted in Fig. 7 is pBI221. 
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EXAMPLE 11 

Construction of 121.lA^«NOS for stable transformation 

The vector pBI121 (Jefferson et al. 1987 
EMBO J, £: 3901 -3907) was prepared for ligation by 
digestion with BamHI and EcoICR I (Promega) which 
excises the GUS coding region leaving the 35S promoter 
and NOS terminator intact. The borage A6-desaturase 
cDNA was excised from the Bluescript plasmid 
(Stratagene) by digestion with BamHI and Xhol. The 
Xhol end was made blunt by use of the Klenow fragment. 
This fragment was then cloned into the BamHI /EcoICR I 
sites of PBI121, yielding 121.1A^N0S (Fig. 7). In 
121. A^. NOS, the remaining portion (backbone) of the 
restriction map depicted in Fig. 7 is pBI121. 
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SXAMPLS 12 
Transient Bs^ression 

All work involving protoplasts was performed 
in a sterile hood. One ml of packed carrot suspension 
cells were digested in 30 mis plasmolyzing solution 
(25 g/1 KCl, 3.5 g/1 CaCl2-H20, lOmM MES, pH 5.6 and 
0.2 M mannitol) with 1% cellulase, 0.1% pectolyase, 
and 0.1% dreisalase overnight, in the dark, at room 
temperature. Released protoplasts were filtered 
through a 150 ]im mesh and pelleted by centrif ugation 
(lOOx g, 5 min.) then washed twice in plasmolyzing 
solution. Protoplasts were counted using a double 
chambered hemocytometer . DNA was trans fected into the 
protoplasts by PEG treatment as described by Nunberg 

and Thomas (1993 Methods in Plant MQl^cular Biology 

and Bio technology . B.R. Glick and J.E. Thompson, eds, 
pp. 241-248) using 10^ protoplasts and 50-70 ug of 
plasmid DNA (221. A6 .NOS) . Protoplasts were cultured 
in 5 mis of MS media supplemented with 0.2M mannitol 
and 3 pm 2,4-D for 48 hours in the dark with shaking. 
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EXAMPLE 13 
Stable trans fonaat ion of tobacco 

121,1A^N0S plasmid construction was used to 
transform tobacco (Nicotiana tabacum cv, xanthi) via 
jyarobapteriuya according to standard procedures (Horsh 
et al,, 1985 Science 227 : 1229-1231; Bogue et al,, 
1990 Mol. Gen. Genet. 221:49-57), except that initial 
trans formants were selected on 100 ug/ml kanainycin. 
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EXAMPLE 14 

Preparation and analysis o£ 
fatty acid methyl esters (FAMBs) 

1 

Tissue from transfected protoplasts and 
transformed tobacco plants was frozen in liquid 
nitrogen and lyophilized overnight. FAMEs were 
5 prepared as described by Dahmer et al (1989 J. Amer. 
Oil Chem, Soc. 543 - 548) . In some cases, the 
solvent was evaporated again, and the FAMEs were 
resuspended in ethyl acetate and extracted once with 
deionized water to remove any water soluble 

10 contaminants. The FAMEs were analyzed by gas 

chromatography (GC) on a J&W Scientific DB-wax column 
(30 m length, 0.25 mm ID, 0.25 um film) . 

An example of a transient assay is shown in 
Fig. 8 which represents three independent 

15 transfections pooled together. The addition of the 
borage A6-desaturase cDNA corresponds with the 
appearance of gamma linolenic acid (GLA) which is one 
of the possible products of A6 - desaturase . 
Furthermore, transgenic tobacco containing the borage 

20 A6 - desaturase driven by the cauliflower mosaic virus 
35S promoter also produce GLA as well as octa- 
decaenoic acid (18:4) which is formed by the further 
desaturation of GLA (Fig. 9) . These results indicate 
that the borage delta 6 -desaturase gene can be used to 

25 transform plant cells to achieve altered fatty acid 
compositions . 
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EXAMPLE 15 

Isolation of an Evening Primrose A6*desaturase gene 

Total RNA was isolated from evening primrose 
embryos using the method of Chang, Puryear, and 
Cairney (1993) Plant Mol Biol Reporter 11:113-116. 
Poly a"" RNA was selected on oligotex beads (Qiagen) and 
used as a template for cDNA synthesis. The cDNA 
library was constructed in the lambda ZAP II vector 
(Stratagene) using the lambda ZAP II vector kit. The 
primary library was packaged with Gigapack II Gold 
packaging extract (Stratagene) . 

PGR primers based on sequences in the borage 
A6 -desaturase gene were synthesized by a commercial 
source using standard protocols and included the 
following oligonucleotides: 

5^ AAACCAATCCATCCAAGRA 3* SEQ ID NO: 27 

5* KTGGTGGAAATGGAMSCATAA 3' SEQ ID NO: 28 
(R=A and G, K=G and T, M=A and C, S=G and C) 

A primer that matches a region that flanks 
the insertion site of the lambda ZAP II vector was 
also synthesized using an ABI394 DNA synthesizer and 
standard protocols. This primer ahd the following 
sequence : 

5' TCTAGAACTAGTGGATC 3' SEQ ID NO: 29 

An aliquot of the cDNA library was used 
directly as template in a PGR reaction using SEQ ID 
NO: 27 and SEQ ID NO: 29 as primers. The reactions 
were carried out in a volume of 50 ul using an 
annealing temperature of SO^'C for 2 minutes, an 
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extension temperature of 72*'C for 1.5 minutes, and a 
melting temperature of 94 ""C for 1 minute for 29 
cycles. A final cycle with a 2 minute annealing at 
50 °C and a 5 minute extension at 72 °C completed the 
reaction. One ul from this reaction was used as a 
template in a second reaction using the same 
conditions except that the primers were SEQ ID NO: 27 
and SEQ ID NO: 28. A DNA fragment of predicted size 
based on the location of the primer sequences in the 
the borage A6 -desaturase cDNA was isolated. 

This PGR fragment was cloned into pT7 Blue 
(Novagen) and used to screen the evening primrose cDNA 
library at low stringency conditions: The 
hybridization buffer used was 1% bovine serum albumin 
(crystalline fraction V), ImM EDTA, 0.5 M NaHP04pH7.2, 
and 7% SDS. The hybridizations were at 65''C. The 
wash buffer was ImM NaaEDTA, 40 mM NaHP04pH7.2 and 1% 
SDS. Primary screens were washed at 25°C. Secondary 
and tertiary screens were washed at 25'*C, 37''C, and 
42°C. One of the positively hybridizing clones that 
was identified in the evening primrose cDNA library 
was excised as a phagemid in pBluescript. The DNA 
sequence of the 1687 bp insert of this phagemid 
(PIB9748-4) was determined (Fig. 10, SEQ ID NO: 26) 
using the ABIPRISM^ dye terminator cycle sequencing 
core kit from Perkin Elmer according to the 
manufacturer's protocol. The sequence encodes a full 
length protein of 450 amino acids (SEQ ID NO:27) with 
a molecular weight of 51492 daltons. 

Alignment of the deduced amino acid sequence 
with that of borage A6- desaturase was performed using 
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the Geneworks program (Fig. 11) . The evening primrose 
A6-desaturase protein is identical at 58% of the 
residues and similar at an additional 20% of the 
residues. Only two small gaps, near the carboxy 
terminal end of the protein were introduced by the 
program to obtain the alignment (Fig. 11) . The two 
proteins were compared using two different alogorithms 
that measure the hydrophobicity of regions to the 
protein. Figures 12A and 12B are Kyte-Doolittle 
hydrophobicity plots of borage A6-desaturase and 
evening primrose A6 -desaturase, respectively. 
Figures 13A and 13B are Hopwood hydrophobicity plots 
generated in the program DNA Strider for the same 
proteins. A discussion of the algorithim used to 
generate these plots can be found in Hopp, T.P. and 
Woods, K.R. 1983 Molecular Immunolocfv 2iL:483'89. 
Substantial similarity exists between the borage and 
evening primrose proteins using either algorithm. 
TMPredict, a program that predicts the location of 
transmembrane regions of proteins was run on the two 
sequences and again similar results were obtained 
(Figures 14 and 15) • Several weights matrices are used 
in scoring the predictions as reported in Hofmann, K. 
and Stoffel, W. 1993 Biol. C. Hoppe-Scyler 347:156. 
Positive values (x-axis) greater than 500 are 
considered significant predictors of a membrane 
spanning region; the x-axis represents the linear 
amino acid sequences. 

The membrane bound desaturases of plants 
possess three histidine rich motif fs (HRMs) . These 
motiffs are identified in the evening primrose 
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sequence and are indicated in Figure 10 by underlined 
plain text. The motifs in this sequence were 
identical to those found in borage A6-desaturase with 
the exception of those that are italicized (S 161 and 
L374) . The borage A6-desaturase is unique among known 
membrane bound desaturases in having a cytochrome b5 
domain at the carboxy terminal end. The evening 
primrose protein encoded by pIB9748-4 also has this 
domain. The heme binding motif f of chtochrome b5 
proteins is indicated in Figure 10 by underlined bold 
text . 

These data indicate that a A6-desaturase 
cDNA from evening primrose has been isolated and 
charac teri zed . 
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BXAMPLE 16 

Construction of expression vectors for transient and 
stable expression of an evening primrose A6-desatnrase 

The evening primrose A6 -desaturase cDNA is 
excised from the Bluescript phagemid by digestion with 
Xba I and Xho I* The entire cDNA sequence including 
the 5 ' transcribed but untranslated region depicted in 
Figure 10 (SEQ ID NO: 26) is operably linked to any one 
of various promoters and/or other regulatory elements 
in an expression vector, in order to effect 
transcription and translation of the A6 -desaturase 
gene. Alternatively, the cDNA sequence depicted in 
Figure 10 may be trimmed at the 5 ' end so that the 5 ' 
transcribed but untranslated sequence is removed. The 
A of the ATG translational start codon is then made 
the first nucleotide following the promoter and/or 
other regulatory sequence in an expression vector. 
In order to express the subject evening 
primrose cDNA in pBI221 (Jefferson et al. 1987 EMBO J, 

3901 -3907) the following manipulations are 
performed: 

The plasmid pBI221 is digested with EcoICR I 
(Promega) or Eel 136 II (NEB) and Xba I which excises 
the GUS coding region and leaves the 35S promoter and 
NOS terminator intact. The evening primrose A6- 
desaturase cDNA is excised from pIB9748-4 by digestion 
with Xba I and Xho I. The Xho I end is made blunt by 
use of the Klenow fragment. The excised gene is then 
cloned into the cloned into the Xba I/Eco ICR I sites 
of pBI221. The resulting construct is then 
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transfected into carrot protoplasts. One ml of packed 
carrot suspension cells are digested in 30 ml of 
^ plasmolyzing solution {25 g/1 KCl 3.5 g/1 CaCls-HjO, 10 
znM MES, pH 5.6 and 0.2 M mannitol) with 1% cellulase 
0.1% pectolyase, and 0-1% dreisalase overnight, in the 
dark, at room temperature. Released protoplasts are 
^ filtered through a 150 lam mesh and pelleted by 

centrifugation (100 x g, 5 minutes) , then washed twice 
in plasmolyzing solution. Protoplasts are counted 
using a double chambered hemocy tometer . DNA is 
transfected into the protoplasts by PEG treatment as 
described by Nunberg and Thomas (1993 Methods in Plant 
Molecular Biology and Bio technology. B.R. Glick and 
J.E. Thompson, eds. pp 241-248) using 10^ protoplasts 
and 50-70 ug of DNA from the above construct. 
Protoplasts are cultured in 5 ml of MS medium 
supplemented with 0.2 M mannitol and 3 ]M 2, 4-D for 
48 hours in the dark with shaking. Tobacco is 
transformed with the same A6-desaturase expression 
construct by following the method of Example 13. 

In order to express the subject evening 
primrose cDNA in pBI121 (Jefferson et al. 1987 EMLBO 
J. 6^:3901-3907) , the following manipulations are 
performed: 

The plasmid pBI121 is digested with EcoICR I 
(Promega) or Eel 136 II (NEB) and Xba I which excises 
the GUS coding region and leaves the 35 S promoter and 
NOS terminator intact. The evening primrose A6- 
desaturase cDNA is excised from pIB9748-4 by digestion 
with Xba I and Xho I. The Xho I end is made blunt by 
use of the Klenow fagment. The excised gene is then 
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cloned into the Xba I/Eco ICR I sites of pBI121, The 
resulting construct is used to transform Arabidopsis 
thaliana via Agrobacterium according to standard 
protocols (Bechtold N., Ellis. J., and Pelletier, G 
1993 C.R. Acad Sci Paris 316:1194-1199). Carrot and 
tobacco are transformed as described above. 
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SEQUENCE LISTING 

1) GENERAL INFORMATION: 

(i) APPLICANT: Thomas, Terry L. 



(ii) TITLE OF INVENTION: PRODUCTION OF GAMMA LINOLENIC ACID BY A 

DELTA 6-DESATURASE 

(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Scully, Scott, Murphy & Presser 

(B) STREET: 400 Garden City Plaza 

(C) CITY: Garden City 

(D) STATE: New York 

(E) COUNTRY: United States 

(F) ZIP: 11530 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) PILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Presser, Leopold 

(B) REGISTRATION NUMBER: 19,827 

(C) REFERENCE/DOCKET NUMBER: 8383ZYXWVU 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (516) 742-4343 

(B) TELEFAX: (516) 742-4366 

(C) TELEX: 230 901 SANS UR 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2002*. 3081 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



GCTAGCCACC 


AGTGACGATG 


CCTTGAATTT 


GGCCATTCTG 


ACCCAGGCCC 


GTATTCTGAA 


60 


S^CCCGCATT 


CGCATTGTTA 


ATCGTTTGTT 


CAACCATGCC 


CTGGGTAAAC 


GTTTAGACAC 


120 


J&ACCTTGCCA 


GACCACGTTA 


GTTTGAGTGT 


TTCCGCCCTG 


GCGGCCCCGA 


TTTTTTCCTT 


180 


#GCGGCTTTG 


GGCAATCAGG 


CGATCGGGCA 


ATTGCGTTTG 


TTTGACCAGA 


CTTGGCCCAT 


240 


itCAGGAAATT 


GTCATTCACC 


AAGACCATCC 


CTGGCTCAAT 


TTACCCCTGG 


CGGATTTATG 


300 


iGGATGATCCG 


AGCCGAATGT 


TGATCTATTA 


CCTACCGGCC 


CACAGTGAAA 


CGGATTTAGT 


360 


#GGCGCAGTG 


GTGAATAATT 


TAACGTTGCA 


ATCTGGGGAC 


CATTTAATAG 


TGGGACAAAA 


420 


:^ACCCCAACCC 


AAGACCAAAC 


GGCGATCGCC 


TTGGCGCAAA 


TTTTCCAAAC 


TGATTACCAA 


480 


JJCCTGCGGGAG 


TATCAGCGGT 


ATGTCCAACA 


GGTGATATGG 


GTGGTGTTGT 


TTTTATTGTT 


540 


GATGATTTTT 


CTGGCCACCT 


TCATCTACGT 


TTCCATTGAT 


CAACATATTG 


CCCCAGTGGA 


600 


CGCGTTGTAT 


TTTTCCGTGG 


GCATGATTAC 


CGGGGCCGGT 


GGCAAGGAAG 


AGGTGGCCGA 


660 


AAAGTCGCCC 


GATATCATCA 


AAGTATTCAC 


AGTGGTGATG 


ATGATCGCCG 


GGGCGGGGGT 


720 


GATTGGTATT 


TGTTATGCCC 


TACTGAATGA 


TTTCATCCTT 


GGCAGTCGCT 


TTAGTCAGTT 


780 


TTTGGATGCG 


GCCAAGTTAC 


CCGATCGCCA 


TCACATCATC 


ATTTGTGGGC 


TGGGGGGAGT 


840 


GAGCATGGCC 


ATTATTGAAG 


AGTTAATTCA 


CCAGGGCCAT 


GAAATTGTGG 


TAATCGAAAA 


900 
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GGATACAGAT 


AATCGTTTCT 


TGCATACGGC CCGCTCCCTG 


GGGGTGCCCG 


TAATTGTGGA 


960 


GGATGCCCGC 


CTAGAAAGAA 


CGTTGGCCTG CGCCAATATC 


AACCGAGCCG 


AAGCCATTGT 


1020 


GGTGGCCACC 


AGCGACGACA 


CCGTTAACTT GGAAATTGGC 


CTAACTGCCA 


AGGCGATCGC 


1080 


CCCTAGCCTG 


CCAGTGGTGT 


TGCGTTGCCA GGATGCCCAG 


TTTAGCCTGT 


CCCTGCAGGA 


1140 


AGTATTTGAA 


TTTGAAACGG 


TGCTTTGTCC GGCGGAATTG 


GCCACCTATT 


CCTTTGCGGC 


1200 


GGCGGCCCTG 


GGGGGCAAAA 


TTTTGGGCAA CGGCATGACC 


GATGATTTGC 


TGTGGGTAGC 


1260 


CCTAGCCACC 


TTAATCACTC 


CTAACCATCC CTTTGCCGAC 


CAATTGGTTA 


AAATTGCAGC 


1320 


CCAAAAGTCT 


GATTTCGTTC 


CCCTCTATCT AGAACGGGGT 


GGCAAAACCA 


TCCATAGCTG 


1380 


ibAATTATTG GGTACCCATC TCGACTCTGG AGACGTGTTG TATTTAACCA TGCCCGCCAC 


1440 


:¥GCCCTAGAG 


CAACTTTGGC 


GATCGCCCCG TGCCACTGCT 


GATCCTCTGG 


ACTCTTTTTT 


1500 


;#GTTTAGCAT 


GGGGGGATGG 


AACTCTTGAC TCGGCCCAAT 


GGTGATCAAG 


AAAGAACGCT 


1560 


I^TGTCTATGT 


TTAGTATTTT 


TAAGTTAACC AACAGCAGAG 


GATAACTTCC 


AAAAGAAATT 


1620 


..AAGCTCAAAA 


AGTAGCAAAA 


TAAGTTTAAT TCATAACTGA 


GTTTTACTGC 


TAAACAGCGG 


1680 


.i^-GCAAAAAAG 


TCAGATAAAA 


TAAAAGCTTC ACTTCGGTTT 


TATATTGTGA 


CCATGGTTCC 


1 '7An 

± / <kU 


'JCAGGCATCTG 


CTCTAGGGAG 


TTTTTCCGCT GCCTTTAGAG 


AGTATTTTCT 


CCAAGTCGGC 


1800 


; t'AACTCCCCC ATTTTTAGGC AAAATCATAT ACAGACTATC 


CCAATATTGC 


CAGAGCTTTG 


1860 


ATGACTCACT 


GTAGAAGGCA 


GACTAAAATT CTAGCAATGG 


ACTCCCAGTT 


GGAATAAATT 


1920 


TTTAGTCTCC 


CCCGGCGCTG 


GAGTTTTTTT GTAGTTAATG 


GCGGTATAAT 


GTGAAAGTTT 


1980 


TTTATCTATT 


TAAATTTATA 


A ATG CTA ACA GCG GAA 


AGA ATT AAA TTT ACC 


2031 



Met Leu Thr Ala Glu Arg lie Lys Phe Thr 
15 10 



GAG AAA CGG GGG TTT CGT CGG GTA CTA AAC CAA CGG GTG GAT GCG TAG 2079 
Gin Lys Arg Gly Phe Arg Arg Val Leu Asn Gin Arg Val Asp Ala Tyr 

15 20 25 
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TTT GCC GAG CAT GGC CTG ACC CAA AGG GAT AAT CCC TCC ATG TAT CTG 2127 
Phe Ala Glu His Gly Leu Thr Gin Arg Asp Asn Pro Ser Met Tyr Leu 
30 35 40 

AAA ACC CTG ATT ATT GTG CTC TGG TTG TTT TCC GCT TGG GCC TTT GTG 2175 
Lys Thr Leu lie lie Val Leu Trp Leu Phe Ser Ala Trp Ala Phe Val 
45 50 55 

CTT TTT GCT CCA GTT ATT TTT CCG GTG CGC CTA CTG GGT TGT ATG GTT 2223 
Leu Phe Ala Pro Val lie Phe Pro Val Arg Leu Leu Gly Cys Met Val 
60 65 70 

TTG GCG ATC GCC TTG GCG GCC TTT TCC TTC AAT GTC GGC CAC GAT GCC 2271 
Leu Ala lie Ala Leu Ala Ala Phe Ser Phe Asn 'Val Gly His Asp Ala 
75 80 85 90 

AAC CAC AAT GCC TAT TCC TCC AAT CCC CAC ATC AAC CGG GTT CTG GGC 2319 
^n His Asn Ala Tyr Ser Ser Asn Pro His lie Asn Arg Val Leu Gly 

95 100 105 

iATG ACC TAC GAT TTT GTC GGG TTA TCT AGT TTT CTT TGG CGC TAT CGC 2367 
:Met Thr Tyr Asp Phe Val Gly Leu Ser Ser Phe Leu Trp Arg Tyr Arg 

fU 110 115 120 

JgAC AAC TAT TTG CAC CAC ACC TAC ACC AAT ATT CTT GGC CAT GAC GTG 2415 
,His Asn Tyr Leu His His Thr Tyr Thr Asn lie Leu Gly His Asp Val 

ri 125 130 135 

^GAA ATC CAT GGA GAT GGC GCA GTA CGT ATG AGT CCT GAA CAA QAA CAT 2463 
Glu lie His Gly Asp Gly Ala Val Arg Met Ser Pro Glu Gin Glu His 

1^ 140 145 150 

'^TT GGT ATT TAT CGT TTC CAG CAA TTT TAT ATT TGG GGT TTA TAT CTT 2511 
Val Gly lie Tyr Arg Phe Gin Gin Phe Tyr lie Trp Gly Leu Tyr Leu 
155 160 165 170 

TTC ATT CCC TTT TAT TGG TTT CTC TAC GAT GTC TAC CTA GTG CTT AAT 2559 
Phe lie Pro Phe Tyr Trp Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn 

175 180 185 

AAA GGC AAA TAT CAC GAC CAT AAA ATT CCT CCT TTC CAG CCC CTA GAA 2607 
Lys Gly Lys Tyr His Asp His Lys lie Pro Pro Phe Gin Pro Leu Glu 
190 195 200 

TTA GCT AGT TTG CTA GGG ATT AAG CTA TTA TGG CTC GGC TAC GTT TTC 2655 
Leu Ala Ser Leu Leu Gly lie Lys Leu Leu Trp Leu Gly Tyr Val Phe 
205 210 215 

GGC TTA CCT CTG GCT CTG GGC TTT TCC ATT CCT GAA GTA TTA ATT GGT 2703 
Gly Leu Pro Leu Ala Leu Gly Phe Ser lie Pro Glu Val Leu He Gly 
220 225 230 

GCT TCG GTA ACC TAT ATG ACC TAT GGC ATC GTG GTT TGC ACC ATC TTT 2751 
Ala Ser Val Thr Tyr Met Thr Tyr Gly He Val Val Cys Thr He Phe 
235 240 245 250 
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ATG CTG GCC CAT GTG TTG GAA TCA ACT GAA TTT CTC ACC CCC GAT GGT 2799 
Met Leu Ala His Val Leu Glu Ser Thr Glu Phe Leu Thr Pro Asp Gly 

255 260 265 

GAA TCC GGT GCC ATT GAT GAC GAG TGG GCT ATT TGC CAA ATT CGT ACC 2847 
Glu Ser Gly Ala lie Asp Asp Glu Trp Ala lie Cys Gin lie Arg Thr 
270 275 280 

ACG GCC AAT TTT GCC ACC AAT AAT CCC TTT TGG AAC TGG TTT TGT GGC 2895 
Thr Ala Asn Phe Ala Thr Asn Asn Pro Phe Trp Asn Trp Phe Cys Gly 
285 290 295 

GGT TTA AAT CAC CAA GTT ACC CAC CAT CTT TTC CCC AAT ATT TGT CAT 2943 
Gly Leu Asn His Gin Val Thr His His Leu Phe Pro Asn lie Cys His 
300 305 310 

ATT CAC TAT CCC CAA TTG GAA AAT ATT ATT AAG GAT GTT TGC CAA GAG 2991 
13.6 His Tyr Pro Gin Leu Glu Asn lie lie Lys Asp Val Cys Gin Glu 
ns 320 325 330 

^T GGT GTG GAA TAT AAA GTT TAT CCC ACC TTC AAA GCG GCG ATC GCC 3039 
Bie Gly Val Glu Tyr Lys Val Tyr Pro Thr Phe Lys Ala Ala lie Ala 
fU 335 340 345 

JS:T AAC TAT CGC TGG CTA GAG GCC ATG GGC AAA GCA TCG TGACATTGCC 3088 

Ser Asn Tyr Arg Trp Leu Glu Ala Met Gly Lys Ala Ser 

n 350 355 360 



3^GGGATTGA 


AGCAAAATGG 


CAAAATCCCT 


CGTAAATCTA 


TQATCGAAGC 


CTTTCTGTTG 


3148 


^CCGCCGACC 


AAATCCCCGA 


TGCTGACCAA 


AGGTTGATGT 


T3GCATTGCT 


CCAAACCCAC 


3208 


TTTGAGGGGG 


TTCATTGGCC 


GCAGTTTCAA 


GCTGACCTAG 


GAGGCAAAGA 


TTGGGTGATT 


3268 


TTGCTCAAAT 


CCGCTGGGAT 


ATTGAAAGGC 


TTCACCACCT 


TTGGTTTCTA 


CCCT6CTCAA 


3328 


TGGGAAGGAC 


AAACCGTCAG 


AATTGTTTAT 


TCTGGTGACA 


CCATCACCGA 


CCCATCCATG 


3388 


TGGTCTAACC 


CAGCCCTGGC 


CAAGGCTTGG 


ACCAAGGCCA 


TGCAAATTCT 


CCACGAGGCT 


3448 


AGGCCAGAAA 


AATTATATTG 


GCTCCTGATT 


TCTTCCGGCT 


ATCGCACCTA 


CCGATTTTTG 


3508 


AGCATTTTTG 


CCAAGGAATT 


vCTATCCCCAC 


TATCTCCATC 


CCACTCCCCC 


GCCTGTACAA 


3568 


AATTTTATCC 


ATCAGCTAGC 










3588 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Thr Ala Glu Arg lie Lys Phe Thr Gin Lys Arg Gly Phe Arg 
15 10 15 

Arg Val Leu Asn Gin Arg Val Asp Ala Tyr Phe Ala Glu His Gly Leu 
20 25 30 

Thr Gin Arg Asp Asn Pro Ser Met Tyr Leu Lys Thr Leu lie lie Val 
35 40 45 

Leu Tip Leu Phe Ser Ala Trp Ala Phe Val Leu Phe Ala Pro Val lie 
50 55 60 

Phe Pro Val Arg Leu Leu Gly Cys Met Val Leu Ala lie Ala Leu Ala 
35 70 75 80 

ala Phe Ser Phe Asn Val Gly His Asp Ala Asn His Asn Ala Tyr Ser 

85 90 95 

ii^fer Asn Pro His lie Asn Arg Val Leu Gly Met Thr Tyr Asp Phe Val 
100 105 110 

Gly Leu Ser Ser Phe Leu Trp Arg Tyr Arg His Asn Tyr Leu His His 
115 120 125 

Shr Tyr Thr Asn lie Leu Gly His Asp Val Glu lie His Gly Asp Gly 
130 135 140 

'klsi Val Arg Met Ser Pro Glu Gin Glu His Val Gly He Tyr Arg Phe 
1145 150 155 160 

Gin Gin Phe Tyr He Trp Gly Leu Tyr Leu Phe He Pro Phe Tyr Trp 

165 170 175 

Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn Lys Gly Lys Tyr His Asp 
180 185 190 

His Lys lie Pro Pro Phe Gin Pro Leu Glu Leu Ala Ser Leu Leu Gly 
195 200 205 

lie Lys Leu Leu Trp Leu Gly Tyr Val Phe Gly Leu Pro Leu Ala Leu 
210 215 220 

Gly Phe Ser He Pro Glu Val Leu He Gly Ala Ser Val Thr Tyr Met 
225 230 235 240 

Thr Tyr Gly He Val Val Cys Thr He Phe Met Leu Ala His Val Leu 

245 250 255 

Glu Ser Thr Glu Phe Leu Thr Pro Asp Gly Glu Ser Gly Ala He Asp 
260 265 270 
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Asp Glu Trp Ala lie Cys Gin lie Arg Thr Thr Ala Asn Phe Ala Thr 
275 280 285 

Asn Asn Pro Phe Trp Asn Trp Phe Cys Gly Gly Leu Asn His Gin Val 
290 295 300 

Thr His His Leu Phe Pro Asn lie Cys His lie His Tyr Pro Gin Leu 
305 310 315 320 

Glu Asn lie lie Lys Asp Val Cys Gin Glu Phe Gly Val Glu Tyr Lys 

325 330 335 

Val Tyr Pro Thr Phe Lys Ala Ala lie Ala Ser Asn Tyr Arg Trp Leu 
340 345 350 

Glu Ala Met Gly Lys Ala Ser 
355 

4M) INFORMATION FOR SEQ ID NO: 3: 
2 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1884 base pairs 

% (B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

p. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



AGCTTCACTT 


CGGTTTTATA 


TTGTGACCAT 


GGTTCCCAGG 


CATCTGCTCT 


AGGGAGTTTT 


60 


TCCGCTGCCT 


TTAGAGAGTA 


TTTTCTCCAA 


GTCGGCTAAC 


TCCCCCATTT 


TTAGGCAAAA 


120 


TCATATACAG 


ACTATCCCAA 


TATTGCCAGA 


GCTTTGATGA 


CTCACT6TAG 


AAGGCAGACT 


180 


AAAATTCTAG 


CAATGGACTC 


CCAGTTGGAA 


TAAATTTTTA 


GTCTCCCCCG 


GCGCTGGAGT 


240 


TTTTTTGTAG 


TTAATGGCGG 


TATAATGTGA 


AAGTTTTTTA 


TCTATTTAAA 


TTTATAAATG 


300 


CTAACAGCGG 


AAAGAATTAA 


ATTTACCCAG 


AAACGGGGGT 


TTCGTCGGGT 


ACTAAACCAA 


360 


CGGGTGGATG 


CCTACTTTGC 


CGAGCATGGC 


CTGACCCAAA 


GGGATAATCC 


CTCCATGTAT 


420 


CTGAAAACCC 


TGATTATTGT 


GCTCTGGTTG 


TTTTCCGCTT 


GGGCCTTTGT 


GCTTTTTGCT 


480 


CCA(5TTATTT 


TTCCGGTGCG 


CCTACTGGGT 


TGTATGGTTT 


TGGCGATCGC 


CTTGGCGGCC 


540 


TTTTCCTTCA 


ATGTCGGCCA 


CGATGCCAAC 


CACAATGCCT 


ATTCCTCCAA 


TCCCCACATC 


600 


AACCGGGTTC 


TGGGCATGAC 


CTACGATTTT 


GTCGGGTTAT 


CTAGTTTTCT 


TTGGCGCTAT 


660 


CGCCACAACT 


ATTTGCACCA 


CACCTACACC 


AATATTCTTG 


GCCATGACGT 


GGAAATCCAT 


720 
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CAGTACGTAT 


GAGTCCTGAA 


CAAGAACATG 


TTGGTATTTA 


TCGTTTCCAG 


780 




XXX V7\JV7\3 XXX 


fXX fHX Vmt X X X X \^ 


ATTCCCTTTT 


ATTGGTTTCT 


CTACGATGTC 


840 






CAAATATCAC 


GACCATAAAA 


TTCCTCCTTT 

^ ^ ^^^^ ^^^^ ^ 


CCAGCCCCTA 


900 




GTTT'GriTAGG 

V.7 X X X WW J- *X WW 


GATTAAGCTA 


TTATGGCTCG 

X X jn, X \,7V7W X V^V7 


GCTACGTTTT 


CGGCTTACCT 


960 


w X V7w\^ X v«> X VJ\7 


X X X X X 


TCCTGAAGTA 


TTAATTGGTG 


CTTCGGTAAC 


CTATATGACC 


1020 


TATGGCATCG 

X x^x w\7\.An X v^v7 


TGGTTTGCAC 


CATCTTTATG 


CTGGCCCATG 


TGTTGGAATC 


AACTGAATTT 


1080 




ATGGTGAATC 


CGGTGCCATT 


GATGACGAGT 


GGGCTATTTG 


CCAAATTCGT 


1140 


ACCACGGCCA 


ATTTTGCCAC 


CAATAATCCC 


TTTTGGAACT 


GGTTTTGTGG 


CGGTTTAAAT 


1200 


CACCAAGTTA 


CCCACCATCT 


TTTCCCCAAT 


ATTTGTCATA 


TTCACTATCC 


CCAATTGGAA 


1260 


f^TATTATTA 


AGGATGTTTG 


CCAAGAGTTT 


GGTGTGGAAT 


ATAAAGTTTA 


TCCCACCTTC 


1320 


[aaagcggcga 


TCGCCTCTAA 


CTATCGCTGG 


CTAGAGGCCA 


TGGGCAAAGC 


ATCGTGACAT 


_L J IJ W 


GCCTTGGGA TTGAAGCAAA ATGGCAAAAT CCCTCGTAAA TCTATGATCG AAGCCTTTCT 


1440 


^TTGCCCGCC 


GACCAAATCC 


CCGATGCTGA 


CCAAAGGTTG 


ATGTTGGCAT 


TGCTCCAAAC 


X •uf v u 


ICCACTTTGAG GGGGTTCATT 


GGCCGCAGTT 


TCAAGCTGAC 


CTAGGAGGCA 


AAGATTGGGT 


JL ^ \J \J 


^GATTTTGCTC 


AAATCCGCTG 


GGATATTGAA 


AGGCTTCACC 


ACCTTTGGTT 


TCTACCCTGC 


X VI ^ U 


#CAATGGGAA 


GGACAAACCG 


TCAGAATTGT 


TTATTCTGGT 


GACACCATCA 


CCGACCCATC 


1680 


xeATGTGGTCT 


AACCCAGCCC 


TGGCCAAGGC 


TTGGACCAAG 


GCCATGCAAA 


TTCTCCACGA 


1740 


GGCTAGGCCA 


GAAAAATTAT 


ATTGGCTCCT 


GATTTCTTCC 


GGCTATCGCA 


CCTACCGATT 


1800 


TTTGAGCATT 


TTTGCCAAGG 


AATTCTATCC 


CCACTATCTC 


CATCCCACTC 


CCCCGCCTGT 


1860 


ACAAAATTTT 


ATCCATCAGC 


TAGC 








1884 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
AATATCTGCC TACCCTCCCA AAGAGAGTAG TCATTTTTCA TCAATGGCTG CTCAAATCAA 60 
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GAAATACATT 


ACCTCAGATG 


AACTCAAGAA 


CCACGATAAA 


CCCGGAGATC 


TATGGATCTC 


120 


GATTCAAGGG 


AAAGCCTATG 


ATGTTTCGGA 


TTGGGTGAAA 


GACCATCCAG 


GTG6CAGCTT 


180 


TCCCTTGAAG 


AGTCTTGCTG 


GTCAAGAGGT 


AACTGATGCA 


TTTGTTGCAT 


TCCATCCTGC 


240 


CTCTACATGG 


AAGAATCTTG 


ATAAGTTTTT 


CACTGGGTAT 


TATCTTAAAG 


ATTACTCTGT 


300 


TTCTGAGGTT 


TCTAAAGATT 


ATAGGAAGCT 


TGTGTTTGAG 


TTTTCTAAAA 


TGGGTTTGTA 


360 


TGACAAAAAA 


GGTCATATTA 


TGTTTGCAAC 


TTTGTGCTTT 


ATAGCAATGC 


TGTTTGCTAT 


420 


GAGTGTTTAT 


GGG6TTTTGT 


TTTGTGAGGG 


TGTTTTGGTA 


CATTTGTTTT 


CTGGGTGTTT 


480 


GATGGGGTTT 


CTTTGGATTC 


AGAGTGGTTG 


GATTGGACAT 


GATGCTGGGC 


ATTATATGGT 


540 


^GTGTCTGAT 


TCAAGGCTTA 


ATAAGTTTAT 


GGGTATTTTT 


GCTGCAAATT 


GTCTTTCAGG 


600 


^I^ATAAGTATT 


GGTTGGTGGA 


AATGGAACCA 


TAATGCACAT 


CACATTGCCT 


GTAATAGCCT 


660 


^GAATATGAC 


CCTQATTTAC 


AATATATACC 


ATTCCTTGTT 


GTGTCTTCCA 


AGTTTTTTGG 


720 


pPTCACTCACC 


TCTCATTTCT 


ATGAGAAAAG 


GTTGACTTTT 


GAGTGTTTAT 


CAAGATTCTT 


780 


iTGTAAGTTAT 


CAACATTGGA 


CATTTTACCC 


TATTATGTGT 


GCTGCTAGGC 


TCAATATGTA 


840 


ffGTACAATCT 


CTCATAATGT 


TGTTGACCAA 


GAGAAATGTG 


TCCTATCGAG 


CTCAGGAACT 


900 


r^€TTGGGATGC 


CTAGTGTTCT 


CGATTTGGTA 


CCCGTTGCTT 


GTTTCTTGTT 


TGCCTAATTG 


960 


.rpGGTGAAAGA ATTATGTTTG TTATTGCAAG TTTATCAGTG ACTGGAATGC AACAAGTTCA 


1020 


&TTCTCCTTG 


AACCACTTCT 


CTTCAAGTGT 


TTATGTTGGA 


AAGCCTAAAG 


GGAATAATTG 


1080 


GTTTGAGAiiA 


CAAAC6GATG 


GGACACTTGA 


CATTTCTTGT 


CCTCCTTGGA 


TGGATTGGTT 


1140 


TCATGGTGGA 


TTGCAATTCC 


AAATTGAGCA 


TCATTTGTTT 


CCCAAGATGC 


CTAGATGCAA 


1200 


CCTTAGGAAA 


ATCTCGCCCT 


ACGTGATCGA 


GTTATGCAAG 


AAACATAATT 


TGCCTTACAA 


1260 


TTATGCATCT 


TTCTCCAAGG 


CCAATGAAAT 


GACACTCAGA 


ACATTGAGGA 


ACACAGCATT 


1320 


GCAGGCTAGG 


GATATAACCA 


AGCCGCTCCC 


GAAGAATTTG 


GTATGGGAAG 


CTCTTCACAC 


1380 


TCATGGTTAA 


AATTACCCTT 


AGTTCATGTA 


ATAATTTGAG 


ATTATGTATC 


TCCTATGTTT 


1440 


GTGTCTTGTC 


TTGGTTCTAC 


TTGTTGGAGT 


CATTGCAACT 


TGTCTTTTAT 


GGTTTATTAG 


1500 


ATGTTTTTTA 


ATATATTTTA 


GAGGTTTTGC 


TTTCATCTCC 


ATTATTGATG 


AATAAGGAGT 


1560 


TGCATATTGT 


CAATTGTTGT 


GCTCAATATC 


TGATATTTTG 


GAATGTACTT 


TGTACCACTG 


1620 


TGTTTTCAGT 


TGAAGCTCAT 


GTGTACTTCT 


ATAGACTTTG 


TTTAAATGGT 


TATGTCATGT 


1680 


TATTT 












1685 
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INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 448 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Ala Gin lie Lys Lys Tyr lie Thr Ser Asp Glu Leu Lys Asn 
15 10 15 

His Asp Lys Pro Gly Asp Leu Trp lie Ser lie Gin Gly Lys Ala Tyr 
20 25 30 

Asp Val Ser Asp Trp Val Lys Asp His Pro Gly Gly Ser Phe Pro Leu 
35 40 45 

Lys Ser Leu Ala Gly Gin Glu Val Thr Asp Ala Phe Val Ala Phe His 
50 55 60 

Pro Ala Ser Thr Trp Lys Asn Leu Asp Lys Phe Phe Thr Gly Tyr Tyr 
65 70 75 80 

Leu Lys Asp Tyr Ser Val Ser Glu Val Ser Lys Asp Tyr Arg Lys Leu 

85 90 95 

Val Phe Glu Phe Ser Lys Met Gly Leu Tyr Asp Lys Lys Gly His lie 
100 105 110 

Met Phe Ala Thr Leu Cys Phe lie Ala Met Leu Phe Ala Met Ser Val 
115 120 125 

Tyr Gly Val Leu Phe Cys Glu Gly Val Leu Val His Leu Phe Ser Gly 
130 135 140 

Cys Leu Met Gly Phe Leu Trp lie Gin Ser Gly Trp lie Gly His Asp 
145 150 155 160 

Ala Gly His Tyr Met Val Val Ser Asp Ser Arg Leu Asn Lys Phe Met 

165 170 175 

Gly lie Phe Ala Ala Asn Cys Leu Ser Gly lie Ser lie Gly Trp Trp 
180 185 190 

Lys Trp Asn His Asn Ala His His lie Ala Cys Asn Ser Leu Glu Tyr 
195 200 205 

Asp Pro Asp Leu Gin Tyr lie Pro Phe Leu Val Val Ser Ser Lys Phe 
210 215 220 
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Phe Gly Ser Leu Thr Ser His Phe Tyr Glu Lys Arg Leu Thr Phe Asp 
225 230 235 240 

Ser Leu Ser Arg Phe Phe Val Ser Tyr Gin His Trp Thr Phe Tyr Pro 

245 250 255 

lie Met Cys Ala Ala Arg Leu Asn Met Tyr Val Gin Ser Leu lie Met 
260 265 270 

Leu Leu Thr Lys Arg Asn Val Ser Tyr Arg Ala Gin Glu Leu Leu Gly 
275 280 285 

Cys Leu Val Phe Ser lie Trp Tyr Pro Leu Leu Val Ser Cys Leu Pro 
290 295 300 

Asn Trp Gly Glu Arg lie Met Phe Val lie Ala Ser Leu Ser Val Thr 
305 310 315 320 

Gly Met Gin Gin Val Gin Phe Ser Leu Asn His Phe Ser Ser Ser Val 

325 330 335 

Tyr Val Gly Lys Pro Lys Gly Asn Asn Trp Phe Glu Lys Gin Thr Asp 
340 345 350 

Gly Thr Leu Asp lie Ser Cys Pro Pro Trp Met Asp Trp Phe His Gly 
355 360 365 

Gly Ser Gin Phe Gin lie Glu His His Leu Phe Pro Lys Met Pro Arg 
370 375 380 

Cys Asn Leu Arg Lys lie Ser Pro Tyr Val lie Glu Leu Cys Lys Lys 
385 390 395 400 

His Asn Leu Pro Tyr Asn Tyr Ala Ser Phe Ser Lys Ala Asn Glu Met 

405 410 415 

Thr Leu Arg Thr Leu Arg Asn Thr Ala Leu Gin Ala Arg Asp lie Thr 
420 425 430 

Lys Pro Leu Pro Lys Asn Leu Val Trp Glu Ala Leu His Thr His Gly 
435 440 445 

INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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Trp He Gly His Asp Ala Gly His 
1 5 

INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asn Val Gly His Asp Ala Asn His 
1 5 

INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

val Leu Gly His Asp Cys Gly His 
1 5 

INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (peptide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 

Val He Ala His Glu Cys Gly His 
1 5 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val He Gly His Asp Cys Ala His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

,:1 (i) SEQUENCE CHARACTERISTICS: 

; (A) LENGTH: 8 amino acids 

r (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 
Val val Gly His Asp Cys Gly His 

^2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 5 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

His Asn Ala His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: eimino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (peptide) 



-66- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

His Asn Tyr Leu His His 
1 5 

INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

His Arg Thr His His 
1 5 

INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
His Arg Arg His His 



1 



5 



) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

His Asp Arg His His 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: eimino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

His Asp Gin His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

His Asp His His His 
1 5 

32) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

His Asn His His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Phe Gin lie Glu His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Q His Gin Val Thr His His 

t: 1 5 

U2) INFORMATION FOR SEQ ID NO: 22: 

r; (i) SEQUENCE CHARACTERISTICS: 

jz (A) LENGTH: 5 amino acids 

1' (B) TYPE: amino acid 

U (D) TOPOLOGY: linear 

:f (ii) MOLECULE TYPE: Peptide 

* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

His Val lie His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

His val Ala His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

His He Pro His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



His Val Pro His His 
1 5 



%) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
H (A) LENGTH: 1702 base pairs 

C (B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 
u (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 48.. 1406 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 4 8.. 1406 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CCCCAAAAAT TTTCATTGTT CTCCATCTGG ACCACAGCAT CCACACA ATG GAG GGC 



1 



GAA GCT AAG AAG TAT ATC ACG GCG GAG GAC CTC CGC CGC CAC AAC AAG 
Glu Ala Lys Lys Tyr He Thr Ala Glu Asp Leu Arg Arg Hxs Asn Lys 
5 10 15 

TCC GGC GAT CTC TGG ATC TCC ATC CAG GGC AAG GTC TAG GAC TGC TCT 
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Ser Gly Asp Leu Trp lie Ser lie Gin Gly Lys Val Tyr Asp Cys Ser 
20 25 30 35 

CGG TGG GCG GCG GAG CAC CCC GGC GGC GAG GTC CCG CTC CTC AGT CTG 200 
Arg Trp Ala Ala Glu His Pro Gly Gly Glu Val Pro Leu Leu Ser Leu 

40 45 50 

GCG GGC CAG GAG GTC ACC GAC GCC TTC ATT GCG TAC CAC CCG GGC ACG 248 
Ala Gly Gin Asp Val Thr Asp Ala Phe lie Ala Tyr His Pro Gly Thr 
55 60 65 

GCG TGG CGG CAT CTG GAT CCG CTC TTC ACC GGC TAC TAC TAC CTC AAG 296 
Ala Trp Arg His Leu Asp Pro Leu Phe Thr Gly Tyr Tyr Tyr Leu Lys 
70 75 80 

GAC TTC GAA GTG TCG GAG ATC TCC AAG GAC TAC CGG AGG CTT TTG AAC 344 
Asp Phe Glu Val Ser Glu lie Ser Lys Asp Tyr Arg Arg Leu Leu Asn 
C 85 90 95 

mC ATG TCG CGG TCC GGG ATC TTC GAG AAG AAG GGC CAC CAC ATC ATG 392 
iGlu Met Ser Arg Ser Gly lie Phe Glu Lys Lys Gly His His lie Met 
400 105 110 115 

ITGG ACG TTC GTC GGC 6TT GCG GTC ATG ATG GCG GCA ATC GTC TAC GGC 440 
jTrp Thr Phe Val Gly Val Ala Val Met Met Ala Ala lie Val Tyr Gly 

120 125 130 

,;i3TG CTG GCG TCG GAG TCC GTC GGA GTT CAC ATG CTC TGC GGC GCA CTG 488 

{7al Leu Ala Ser Glu Ser Val Gly Val His Met Leu Cys Gly Ala Leu 

'l^ 135 140 145 

ftTG GGC TTG CTG TGG ATC CAA GCC GCG TAT GTG GGC CAT GAC TCC GGC 536 

"ieu Gly Leu Leu Trp lie Gin Ala Ala Tyr Val Gly His Asp Ser Gly 
150 155 160 

CAT TAC Gag GTG ATG CCA ACC CGT GGA TAC AAC A6A ATC ACG CAA CTC 584 
His Tyr Gin Val Met Pro Thr Arg Gly Tyr Asn Arg lie Thr Gin Leu 
165 170 175 

ATA GCA GGC AAC ATC CTA ACC GGA ATC AGC ATC GCG TGG TGG AAG TGG 632 
lie Ala Gly Asn lie Leu Thr Gly lie Ser lie Ala Trp Trp Lys Trp 
180 185 190 195 

ACC CAC AAC GCC CAC CAC CTC GCC TGC AAC AGC CTC GAC TAC GAC CCC 680 
Thr His Asn Ala His His Leu Ala Cys Asn Ser Leu Asp Tyr Asp Pro 

200 205 210 

GAC CTC CAG CAC ATC CCC GTA TTC GCC GTC TCC ACC CGA CTC TTC AAC 728 
Asp Leu Gin His lie Pro Val Phe Ala Val Ser Thr Arg Leu Phe Asn 
215 220 225 

TCC ATC ACC TCG GTC TTC TAT GGC CGA GTC CTG AAA TTC GAC GAA GTG 776 
Ser He Thr Ser Val Phe Tyr Gly Arg Val Leu Lys Phe Asp Glu Val 
230 235 240 
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GCA CGG TTC CTA GTC AGC TAG CAG CAC TGG ACC TAG TAG CCG GTG ATG 824 
Ala Arg Phe Leu Val Ser Tyr Gin His Trp Thr Tyr Tyr Pro Val Met 
245 250 255 

ATG TTG GGG GGA GTG AAG GTG TTG ATG GAG AGG TTT TTA TTG GTG GTG 872 
lie Phe Gly Arg Val Asn Leu Phe lie Gin Thr Phe Leu Leu Leu Leu 
260 265 270 275 

AGG AGG GGG GAG GTG GGT GAG GGG GGT GTA AAG TTA ATG GGT ATG GGG 920 
Thr Arg Arg Asp Val Pro Asp Arg Ala Leu Asn Leu Met Gly lie Ala 

280 285 290 

GTT TTG TGG AGG TGG TTG GGG GTG TTG GTA TGT TGT GTG GGG AAG TGG 968 
Val Phe Trp Thr Trp Phe Pro Leu Phe Val Ser Gys Leu Pro Asn Trp 
295 300 305 

gCT GAA GGG TTG GGG TTG GTG GTG ATG AGG TTT GGG GTG AGG GGG ATG 1016 
&0 Glu Arg Phe Gly Phe Val Leu lie Ser Phe Ala Val Thr Ala lie 
K 310 315 320 

teAG GAG GTG GAG TTG AGG GTG AAG GAG TTG TGG GGG GAG AG A TAG GTG 1064 
Mln His Val Gin Phe Thr Leu Asn His Phe Ser Gly Asp Thr Tyr Val 
325 330 335 

^PGG GGG GGG AAG GGG GAG AAG TGG TTG GAG AAG GAG AGG AAA GGG AGG 1112 
^ly Pro Pro Lys Gly Asp Asn Trp Phe Glu Lys Gin Thr Lys Gly Thr 
|g40 345 350 355 

XtG GAT ATG AGG TGG GGA GGG TGG ATG GAG TGG TTG TTT GGT GGG GTG 1160 
'ile Asp lie Thr Gys Pro Pro Trp Met Asp Trp Phe Phe Gly Gly Leu 

360 365 370 

GAG TTG GAG TTG GAG GAG GAG TTG TTG GGT AGG GTG GGG GGT GGG GAG 1208 
Gin Phe Gin Leu Glu His His Leu Phe Pro Arg Leu Pro Arg Gly Gin 
375 380 385 

GTT AGG AAG ATT GGG CGG TTG GGT GGG GAG TTG TGT AAG AAG GAG GGG 1256 
Leu Arg Lys lie Ala Pro Leu Ala Arg Asp Leu Gys Lys Lys His Gly 
390 395 400 

ATG GGG TAT AGG AGG TTG GGG TTT TGG GAG GAG GGT AAT GTC AGG AGA 1304 
Met Pro Tyr Arg Ser Phe Gly Phe Trp Asp Asp Ala Asn Val Arg Thr 
405 410 415 

ATT GGG AGG GTG AGG GAT GGG GGG GTT GAG GCG GGT GAG GTT AAT TGG 1352 
lie Arg Thr Leu Arg Asp Ala Ala Val Gin Ala Arg Asp Leu Asn Ser 
420 425 430 435 

GCG' GGG TGG GGT AAG AAA GTT GGG TAT GGG GAA GGT TAT AAG ACC CAT 1400 
Ala Pro Gys Pro Lys Lys Leu Gly Tyr Gly Glu Ala Tyr Asn Thr His 

440 445 450 

GGT TGA TTGTGGTTTT GTGTTGTGGG TTGGAGGATG TTGTTATTAT TGATTTATGT 1456 
Gly * 



-72- 



CCACAATATT GAACTGAATA ACCATGGAAG GCACTACGTT CAGCTTAACT TTGCTTAACT 1516 

TTGCTAGCTG GTTGCGTTCC CTTGTTGGGG GCAAAGTGCA GTATTTATTT TCTTATCCCA 1576 

TGTACTTTTT GATTATTGTT CTTATTCGTA TCATAAATAA TTTATTATTG ATTAATTTTT 1636 

GTTGTAGTTG GGTGTCTATA GCAAGTTTAT AATACTGAGA TATATTTTTT TGGTAAAAAA 1696 

AAAAAA * 1702 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

34et Glu Gly Glu Ala Lys Lys Tyr lie Thr Ala Glu Asp Leu Arg Arg 

j-: 1 5 10 15 

%is Asn Lys Ser Gly Asp Leu Trp lie Ser lie Gin Gly Lys Val Tyr 
20 25 30 

&sp Cys Ser Arg Trp Ala Ala Glu His Pro Gly Gly Glu Val Pro Leu 
35 40 45 

#eu Ser Leu Ala Gly Gin Asp Val Thr Asp Ala Phe lie Ala Tyr His 
V 50 55 60 

Pro Gly Thr Ala Trp Arg His Leu Asp Pro Leu Phe Thr Gly Tyr Tyr 
65 70 75 80 

Tyr Leu Lys Asp Phe Glu Val Ser Glu lie Ser Lys Asp Tyr Arg Arg 

85 90 95 

Leu Leu Asn Glu Met Ser Arg Ser Gly lie Phe Glu Lys Lys Gly His 
100 105 110 

His lie Met Trp Thr Phe Val Gly Val Ala Val Met Met Ala Ala lie 
115 120 125 

Val Tyr Gly Val Leu Ala Ser Glu Ser Val Gly Val His Met Leu Cys 
130 135 140 

Gly Ala Leu Leu Gly Leu Leu Trp lie Gin Ala Ala Tyr Val Gly His 
145 150 155 160 

Asp Ser Gly His Tyr Gin Val Met Pro Thr Arg Gly Tyr Asn Arg lie 

165 170 175 
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Thr Gin Leu lie Ala Gly Asn lie Leu Thr Gly lie Ser lie Ala Trp 
180 185 190 

Trp Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser Leu Asp 
195 200 205 

Tyr Asp Pro Asp Leu Gin His lie Pro Val Phe Ala Val Ser Thr Arg 
210 215 220 

Leu Phe Asn Ser lie Thr Ser Val Phe Tyr Gly Arg Val Leu Lys Phe 
225 230 235 240 

Asp Glu Val Ala Arg Phe Leu Val Ser Tyr Gin His Trp Thr Tyr Tyr 

245 250 255 

Pro Val Met lie Phe Gly Arg Val Asn Leu Phe lie Gin Thr Phe Leu 
260 265 270 

tCeu Leu Leu Thr Arg Arg Asp Val Pro Asp Arg Ala Leu Asn Leu Met 

■U 275 280 285 

Mly lie Ala Val Phe Trp Thr Trp Phe Pro Leu Phe Val Ser Cys Leu 

fi; 290 295 300 

JI>ro Asn Trp Pro Glu Arg Phe Gly Phe Val Leu lie Ser Phe Ala Val 

,305 310 315 320 

^hr Ala lie Gin His Val Gin Phe Thr Leu Asn His Phe Ser Gly Asp 

n. 325 330 335 

'jThr Tyr Val Gly Pro Pro Lys Gly Asp Asn Trp PLe Glu Lys Gin Thr 
^; 340 345 350 

Lys Gly Thr lie Asp lie Thr Cys Pro Pro Trp Met Asp Trp Phe Phe 
355 360 365 

Gly Gly Leu Gin Phe Gin Leu Glu His His Leu Phe Pro Arg Leu Pro 
370 375 380 

Arg Gly Gin Leu Arg Lys lie Ala Pro Leu Ala Arg Asp Leu Cys Lys 
385 390 395 400 

Lys His Gly Met Pro Tyr Arg Ser Phe Gly Phe Trp Asp Asp Ala Asn 

405 410 415 

Val Arg Thr He Arg Thr Leu Arg Asp Ala Ala Val Gin Ala Arg Asp 
420 425 430 

Leu Asn Ser Ala Pro Cys Pro Lys Lys Leu Gly Tyr Gly Glu Ala Tyr 
435 440 445 

Asn Thr His Gly * 
450 
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WHAT IS CIAIMSP: 

1 

1. An isolated nucleic acid encoding an 
evening primrose A6 -desaturase. 

5 2 . The isolated nucleic acid of Claim 1 

comprising at least one of the nucleotide sequence of 
SEQ ID NO: 26 or nucleotides 49 to 1401 of SEQ ID NO: 
26. 

10 3 . An isolated nucleic acid that codes for 

the amino acid sequence of SEQ ID NO: 27. 

4. A vector comprising the nucleic acid of 
any one Claims 1- 3, 

15 

5. An expression vector comprising the 
isolated nucleic acid of any one of Claims 1-3 operably 
linked to a promoter which effects expression of the 
gene product of said isolated nucleic acid. 

20 

6. An expression vector comprising the 
isolated nucleic acid of any one of Claims 1-3 operably 
linked to a promoter and a termination signal capable 
of effecting expression of the gene product of said 

25 isolated nucleic acid. 

7. The expression vector of Claim 5 wherein 
said promoter is a A6- desaturase promoter, an Anabaena 

30 



35 
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carboxylase promoter, a helianthinin promoter, a 
1 glycinin in promoter, a napin promoter, the 35S 
promoter from CaMV, a helianthinin tissue -specific 
promoter, an oleosin seed- specific promoter, or an 
albiimin seed- specif ic promoter, 

5 

8, The expression vector of Claim 6 wherein 
said promoter is a A6 -desaturase promoter, an iynabaena 
carboxylase promoter, a helianthinin promoter, a 
glycinin promoter, a napin promoter, the 35S promoter 
10 from CaMV, a helianthinin tissue- specif ic promoter, an 
oleosin seed- specif ic promoter, or an albiimin seed- 
specific promoter. 



9. An expression vector comprising the 

15 isolated nucleic acid of any one of Claims 1-3 operably 
linked to a consitutive promoter. 

10. An expression vector comprising the 
isolated nucleic acid of any one of Claims 1-3 operably 

20 linked to a tissue specific promoter. 



11. The expression vector of Claim 6 wherein 
said termination signal is a S vnechocvs t i s termination 
signal, a nopaline synthase termination signal, or a 

25 seed termination signal. 

12. A cell comprising the vector of Claim 4. 



30 
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13, A cell comprising the vector of Claim 5 

14, A cell comprising the vector of Claim 6 



15. The cell of Claim 12 wherein said cell 
5 is an animal cell, a bacterial cell, a plant cell or a 
fungal cell. 



16. The cell of Claim 13 wherein said cell 
is an animal cell, a bacterial cell, a plant cell or a 

10 fungal cell. 

17. The cell of Claim 14 wherein said cell 
is an animal cell, a bacterial cell, a plant cell or a 
fungal cell. 

18. A transgenic bacterium or plant 
comprising the isolated nucleic acid of any one of 
Claims 1-3. 



20 19, A transgenic bacterium or plant 

comprising the vector of Claim 4. 

20. A transgenic bacterium or plant 
comprising the vector of Claim 5. 

25 

21, A transgenic bacterium or plant 
comprising the vector of Claim 6 . 



30 



35 



-77- 



22. A plant or progeny of said plant which 
1 has been regenerated from the plant cell of Claim 15. 

23. The plant of Claim 22 wherein said plant 
is a sunflower, soybean, maize, tobacco, peanut, carrot 

5 or oil seed rape plant. 

24. A method of producing a plant with 
increased gamma linolenic acid (GLA) content which 
comprises: 

10 (a) transforming a plant cell with the 

isolated nucleic acid of any one of Claims 1-3; and 

(b) regenerating a plant with increased GLA 
content from said plant cell. 

15 25. A method of producing a plant with 

increased gamma linolenic acid (GLA) content which 
comprises: 

(a) transforming a plant cell with the vector 
of Claim 4; and 
20 (b) regenerating a plant with increased GLA 

content from said plant cell. 

26. A method of producing a plant with 
increased gamma linolenic acid (GLA) content which 
25 comprises: 

(a) transforming a plant cell with the vector 
of Claim 5; and 



30 
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(b) regenerating a plant with increased GLA 
1 content from said plant cell. 

27, A method of producing a plant with 
increased gamma linolenic acid (GLA) content which 

5 comprises : 

(a) transforming a plant cell with the vector 
of Claim 6; and 

(b) regenerating a plant with increased GLA 
content from said plant cell. 

10 

28, The method of Claim 24 wherein said 
plant is a sunflower, soybean, maize, tobacco, peanut, 
carrot or oil seed rape plant* 

15 29 . The method of Claim 25 wherein said 

plant is a sunflower, soybean, maize, tobacco, peanut, 
carrot or oil seed rape plant. 

30. The method of Claim 26 wherein said 

20 plant is a sunflower, soybean, maize, tobacco, peanut, 
carrot or oil seed rape plant. 

31. The method of Claim 27 wherein said 
plant is a sunflower, soybean, maize, tobacco, peanut, 

25 carrot or oil seed rape plant. 

32. A method of inducing or increasing 
production of gamma linolenic acid (GLA) in an organism 



30 
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lacking in or producing low levels of GLA which 
1 comprises transforming said organism with the isolated 
nucleic acid of any one of Claims 1-3. 

33. A method of inducing or increasing 
5 production of gamma linolenic acid (GLA) in an organism 
deficient or lacking in or producing low levels of GLA 
which comprises transforming said organism with the 
vector of Claim 4 • 

10 34. A method of inducing or increasing 

production of gamma linolenic acid (GLA) in an organism 
deficient or lacking in or producing low levels of GLA 
which comprises transforming said organism with the 
vector of Claim 5 . 

15 

35. A method of inducing or increasing 
production of gamma linolenic acid (GLA) in an organism 
deficient or lacking in or producing low levels of GLA 
which comprises transf oirming said organism with the 

20 vector of Claim 6. 

36. A method of inducing production of gamma 
linolenic acid (GLA) in an organism deficient or 
lacking in or producing low levels of GLA and linoleic 

25 acid (LA) which comprises transforming said organism 
with an isolated nucleic acid encoding bacterial a6- 
desaturase and an isolated nucleic acid encoding a12- 
desaturase. 

30 
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37. A method of inducing production of gamma 
1 linolenic acid (GLA) in an organism deficient or 

lacking in or producing low levels of GLA and linoleic 
acid (LA) which comprises transforming said organism 
with at least one expression vector comprising an 
5 isolated nucleic acid encoding evening primrose a6- 
desaturase and an isolated nucleic acid encoding a12- 
desaturase. 

38. The method of inducing production of 
10 octadecatetraeonic acid in at least one of a plant 

deficient or lacking in or producing low levels of 
octadecatetraenoic acid, a bacteritim which produces a- 
linolenic acid, or a bacterium which exhibits a A15- 
desaturase activity on a GLA substrate which comprises 
15 transforming said plant or bacterium with any one of 
Claims 1-3. 

39. A method of inducing production of 
octadecatetraeonic acid in at least one of a plant 

20 deficient or lacking in or producing low levels of 

octadecatetraenoic acid, a bacterixim which produces a- 
linolenic acid, or a bacterium which exhibits a A15- 
desaturase activity on a GLA substrate which comprises 
transforming said plant or bacterium with the vector of 

25 Claim 4. 

40. A method of inducing production of 
octadecatetraeonic acid in at least one of a plant 

30 
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deficient or lacking in or producing low levels of 
1 octadecatetraenoic acid, a bacterium which produces a- 
linolenic acid, or a bacterium which exhibits a A15- 
desaturase activity on a GLA substrate which comprises 
transforming said plant or bacterixim with the vector of 
5 Claim 5 . 

41* A method of inducing production of 
octadecatetraeonic acid in at least one of a plant 
deficient or lacking in or producing low levels of 
10 octadecatetraenoic acid, a bacterium which produces a- 
linolenic acid, or a bacterium which exhibits a A15- 
desaturase activity on a GLA sxibstrate which comprises 
transforming said plant or bacterium with the vector of 
Claim 6* 

15 

42* A method of inducing production of 
octadecatetraeonic acid in at least one of a plant 
deficient or lacking in or producing low levels of 
octadecatetraenoic acid, a bacterium which produces a- 
20 linolenic acid, or a bacterium which exhibits a A15- 
desaturase activity on a GLA sxibstrate which comprises 
transforming said plant or bacterium with the vector of 
Claim 7. 

25 43. The method of Claim 40 wherein said 

plant is a sunflower, soybean, maize, tobacco, peanut, 
carrot or oil seed rape plant. 
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44. The method of Claim 41 wherein said 

1 plant is a sunflower, soybean, maize, tobacco, peanut, 
carrot or oil seed rape plant. 

45. The method of Claim 42 wherein said 

5 plant is a sunflower, soybean, maize, tobacco, peanut, 
carrot or oil seed rape plant. 

46. The method of Claim 43 wherein said 
plant is a sunflower, soybean, maize, tobacco, peanut, 

10 carrot or oil seed rape plant. 



15 



20 



25 
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ABS'PRAOT QP THE DISCLOSURE 

1 

Linoleic acid is converted into y-linolenic 
acid by the enzyme A6-desaturase, The present 
invention is directed to isolated nucleic acids 
5 comprising the A6 -desaturase gene. More particularly, 
the isolated nucleic acid comprises the promoter, 
coding region and termination regions of the A6 - 
desaturase gene. The present invention provides 
recombinant constructions comprising the A6- desaturase 
10 coding region in functional combination with 

heterologous regulatory sequences. The nucleic acids 
and recombinant constructions of the instant invention 
are useful in the production of GLA in transgenic 
organisms . 
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Pig. 9 



Cmnplete DNA sequence and deduced amino add sequence of 
Evraiag Primrose putative A^-desatnrase 



OOCXaUUUU Wi T l 'l lJ a' il/ri l LnUJ^^ GMS 09C GKk 

M E G E 

OCT AAG AAG tAT ASC ACG OOS GAG GN2 CTC OGC OGC CTtfT AAC AA6 TCC GGC GM CTC TOG 
AKKYITAEDLRRHNKSGDLH 
ATC TOC ATC ens GGC MG GXC TAC GAC GTC tCT COG tGG GC3G GCG GAG CAC OCT GGC GGC 
IS XQGKV. YDVSRWAAB n 9 a a 
GAG GTC OCXS CTC CTC OTG CTG GOC GGC CAO GAC GXC AOC GAC GOC TTC ATT GCG T»C CAC 
BVPL LMLAGQDVTDAFIAYH 
OCXS GGC ACQ GC30 TGG COO OBT CTG GAT OCSO CTC TTC AOC GGC TAC TAC CTC AA6 GAC TTC 
PGTAWRHI.DPLFTGYYI.KDP 
GAA OIG TOG GAG ATC TOC AAG GAC TAC 006 AGO CTT TPS AAC CAG AIG TOG COG TOC GOO 
EVSBISKDYRRLZ.HBHSRSG 
ATC TTC GAG AAG AAG GOC CAC CAC ATC ATQ TOG AOG TTC GXC GOC GTT GOO GTC ATG ATG 
IFBKKGHHIMWTFVGVAVMM 
GOO OCA Ate GTC TAC GOC OIG CTG GOG TOO GAG TOC GTC GQA GTT CAC ATG CTC TOC GOC 
AAIVY GVLASBSVGVHMLCG 
OCA CTG CTG GOC TTO CTG TOO ATC CAA GOC GOO TAT GTO GOC CAT GAC TOC GOC CAT TAC 
ALliGXiLWIQAAYV Q H D g G H y 
CAG GTG ATO OCA AOC OOT GGA TAC AAC AGA ATC AOG CAA CTC ATA OCA GOC AAC ATC OTA 
QVMPTRGYHRITQI.IAGMII. 
ACC GGA ATC AOC ATC GOO TOG TOO AAG TOG ACC CAC AAC GOC CAC CAC CTC GOC TOC AAC 

TGISIAWWKWT H K A H H I. A C N 

AGO CTC GAC TAC GAC OOC GAC CTC CAO CAC ATC OOC GIA TTC GOC GTC TOC ACC OOA CTC 
SLDYDPDI.QHIPVFAVSTRI, 
TIC AAC TCC ATC AOC TOO GTC TTC TKT GOC OOA GTC CTG AAA TTC GAC GAA GTG OCA 000 
FNSXT5VFYGRVI«KFDBVAR« 
TTC CTA GTC AOC TAC CAG CAC TOG AOC TAC TAC 000 GTC ATO XIC TtC GOC OOA GTC AAC 
PLVSYQHWTYYPVMIPGRVN 
CTC TTC ATC CAG AOC TTT TTA TTO CTC CTC AOC AGO OGC GAC GTC OCT GAC OOC GOP CTA 
LFIQTFbr^tLTRRDVPDRAI* 
AAC TTA ATG GOT ATC GOO GTT TTC TOG AOG TOO TTC COG CTC TTC OTA TCT TGT CTC COG 
M,I-MGX AVFIfTWFPI.FVSCl*P 
AAC TOO OCT GAA 000 TTC GOO TTC GTC CTC ATC AGC TTT GOG GTC AOG GOO ATC CAG CAC 
NWPERFGFVItlSFAVTAIQH 
GTC CAG TTC ACO CTC AAC CM: TTC TOC GOC GAC ACA TAC GTO GOC OOC OOC AAG GOC GAC 
VQFTltNHFSGDTYVG^P PKGD 
AAC TOG TTC GAG AAG CAG AGO AAA GOO ACQ ATC GAT ATC AOO TOO OCA COO TOO ATG GAC 

HWP BKQTKG TXDX TCPPHMD 
TOO TTC TTT GOT GOO CTG GAG TTC CAO TIG GAG CAC CAC TTO TTC OCT AGO CTG COO COT 

W F F G O L 6 Z Q ii g H H L F P R I. P R 

GOO CAO CTT AGO AAG ATT GOO OOC TTO OCT COO GAC TTO TOT AAG AAG CAC 000 ATO COO 
GQLRKXAPLARDX.CKKHGMP 
TATAGOAGCTTCGQOTTTTQOGACGCTAATGTCAGO ACA ATT 000 ACQ CTG AGO GAT GOG 
YRSPGPWDAHVRT XRT IiRDA 
GOG OPT CAG GOO OOT GAC CTT AAT TOG GOC COO TGC OOP AAG AAA CTT GOO TAT GOO GAA 

AVQARDL NSA PCPKKLGYGE 

OCT TAT AAC AOC CAT GOT TGA TTC TOG TTT TOT OPT GTO GOP TOO AGO ATC TIC TTA TTA 

AYHTHG* 

TTOAx'x^a:A WW^CAATATIGAACTGAAy^^ 

CU,^iwxuvK *^30 CAAAGTOCAtf AAVri!A !l T riV'i^ ^^ 

AAAAAAAAAA 
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TH|»r«cl ouipu-t for Bor«g* B»lt« fi-d^sa-fcuras* 
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THpr«d output for Eu^nlna prl»ro»» svq 
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